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HUMAN VECEFIOR MOLE€:ei£» 
TECHNICAL FIELD 

5 This invention relates to nucleic acid and amino acid sequmces of human receptor 

molecules and to tfie use of diese sequences in the diagnosis, treatment, and prevention of 
neoplastic, immunological, reproductive, gastrointestinal, nervous, smooth muscle, and 
musculoskeletal disorders. 

BACKGROUND OF THE INVENTION 

10 The term receptor describes proteins that specifically recognize other molecules. 

The category is broad and includes proteins with a variety of functions. The bulk of the 
proteins termed receptors are cell sur&ce proteins which when they bind extracellular 
ligands, produce cellular responses in the areas of growth, differentiation, endocytosis, and 
inunune response. Other receptors facilitate the specific transport of proteins out of the 

15 endoplasmic reticulum and localize en^mes to a particular location in the cell. The tenn 
may also be applied to proteins which act as receptors for ligands with known or unknown 
chemical composition which interact with other cellular components. For example, the 
steroid hormone receptors bind to and regulate transcription of genomic DNA. 

Regulation of cell proliferation, diffoentiation, and migration is important for the 

20 formation and function of tissues. Secreted regulatory proteins such as growth fectors 
coordinate^ control these cellular processes and act as mediators in cell-cell signaling 
pathways. Growth fiu:tors are secreted firom the cell, and they bind to specific cell-surface 
receptors on target cells. The bound receptors trigger intracellular signal transduction 
pathways which activate various downstream effectors that regulate gene expression, cell 

25 division, cell difiTerentiation, cell motility, and other cellular processes. 

Cell sur&ce receptors are typically integral membrane proteins of the plasma 
membrane. These receptors recognize hormones such as catecholamines; peptide 
ho r m o n es ; growth a a^ difi erentiation factors; small peptide factors; galane^' somBtostBttn; 
and tachykinins; and circulatory system-borne signaling molecules. Cell surface receptors 

30 on inunime system cells recognize antigens, antibodies, and major histocompatibility 
complex (MHC>bound peptide. Other cell smfece receptors bind ligands to be 
mtemalized by the cell. This receptor-mediated endocytosis functions in the iq>take of low 
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density lipoproteins tLDLX tiansftSFFin, glucose- or mannose-terminal glycoproteins, 
galactose-terminal glycoproteins, immunoglobulins, phosphovitellogenins, fibrin, 
proteinase-inhibitor complexes, plasminogen activators, and thrombospondin (Lodish, R 
et al. (1995) Molecular Cell Biology, Scientific American Books, New York NY, p. 723; 

5 and Mikhailenko, L et al. (1997) J. Biol, Chem. 272:6784-6791). 

Signal transduction is one process by which cells respond to extracellular signals 
(hormones, neurotransmitter, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions. The process begins with the binding of the signal molecule to a 
cell membrane receptor and ends with the activation of an intracellular target molecule. 

10 Sudi processes regulate many cell functions including cell proliferation, diffeientiatioii, 
gene transcription, and oncogCTic transformatiorL 

Many growth &ctor receptors, including epidermal growth &ctor, platelet-derived 
growth &ctor, ami fibroblast growth fector, contain intrinsic protein kinase activities. 
When the polypeptide growth factor binds to the receptor, it triggers the 

15 autophosphorylation of a tyrosine residue on the receptor. It is believed that these 

phosphorylated sites are recognition sites for the binding of other cytoplasmic signaling 
proteins in the signaling pathway that eventually links the initial receptor activation at the 
cell surface to the activation of a specific intracellular target molecule. These signaling 
proteins contain a common domain referred to as a src homology 2 (SH2) domain. SH2 

20 domains are found in a variety of signaling molecules and oncogenic proteins such as 
phospholipase C^, Ras GTPase activating protein, and pp60*'^ (Lowenstein, E J. et al. 
(1992) Cell 70:431-42). 

Epidermal growth factor (EGF) is a mitogen that stimulates the proliferation of 
epithelial tissue. In addition, some EGF-related proteins act as inductive signals in the 

25 differentiation of embryonic tissue. Proteins belonging to the EGF family share a 

conserved, repeated motif of about 40 amino acids with a characteristic distribution of 
cysteine residues (Nicola, N. A. (1994) Guidebook to Cytokines and Their Receptors, 
Oxford University Press, New York, NY, pages 194-197). These EGF motife are also 
found in numerous proteins outside the EGF fionily, particularly in extracellular proteins 

30 important for various aspects of cell-ceU signaling and recognition. 

Extracellular stimuli which induce eariy response genes include growth factors, 
phorbol esters, okadaic acid, protein synthesis inhibitors, toxins, and abrupt changes in 
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tempoHtuie, pH, and oxygen. The stimulus activates ceth sm&ce receptors and membrane 
bound molecules which initiate signaling cascades that induce the transcription of early 
response genes. These early response genes include the genes for cytokines; fos^ mycjiuty 
the edg-1 receptor, and nuclear receptors, all of which have roles in cellular proliferation 

S and diiSerentiation. 

Many cell surface receptors have seven transmembrane regions, with an 
extracellular N-terminus that binds ligand and a cytoplasmic C-terminus that interacts 
with G proteins (Strosberg, AJ>. (1991) Eur. J. Biochem. 196:1-10). Such G-protein 
coupled receptors (GPCRs) are integral membrane proteins characterized by the presence 

10 of such seven hydrophobic transmembrane domains which span the plasma membrane and 
form a bundle of antiparallel alpha helices. The transmembrane domains account for 
structural and functional features of the receptor. In most cases, the bundle of helices 
forms a binding pocket; however, when the binding site must accommodate more bulky 
molecules, the extraceUular N-terminal segment or one or more of the three extracellular 

15 loops participate in binding and in subsequent induction of conformational change in 
intracellular portions of the receptor. The activated receptor, in turn, interacts with an 
intracellular heterotrimeric G-protein complex which mediates further intracellular 
signaling activities, generally interaction with guanine nucleotide binding (G) proteins and 
the production of second messengers such as cyclic AMP (cAMP), phospholipase C, 

20 inositol triphosphate or ion channel proteins (Baldwin, J.M. (1994) Curr. Opin. Cell Biol. 
6:180-190). 

The amino-terminus of the GPCR is extracellular, of variable length and often 
glycosylated; the carboxy-terminus is cytoplasmic and generally phosphorylated. 
&dracellular loops of the GPCR alternate with intracellular loops and link the 

25 transmembrane domains. The most conserved domains of GPCRs are the transmembrane 
domains and the first two cytoplasmic loops. GPCRs range in size from under 400 to over 
1000 amino acids (Coughlin, S.R (1994) Curr. Opin. CeU Biol. 6:191-197). 

GPCRs respond to a diverse array of ligands including lipid analogs, amino acids 
and their derivatives, peptides, cytokines, and specialized stimuli such as light, taste, and 

30 odor. GPCRs function in physiological processes including vision (the ifaodopsios), smell 
(Ae ol&ctory receptors), neurotransmission (muscarinic acetylcholine, dopanune, and 
adrenergic receptors), and hormonal response (luteiniang hormone and thyroid- 
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sdmulatmg honnone receptors). 

GPCR mutations, which may cause loss of fimction or constitutive activation, have 
been associated with numerous human diseases (Coughlin, supra) . For instance, retinitis 
pigmentosa may arise fiom mutations in the rhodopsin gene. Pamia, J. et al. (1993, Nature 

5 365:649*65 1) report that somatic activating mutations in the thyrotropin receptor cause 
hyperfunctioning thyroid adenonoas and suggest that certain G-protein-coiq>led receptors 
susceptible to constitutive activation may behave as proto-oncogenes. 

The fri2zled cell surface receptor, originally identified in Drosophila 
melanoeaster. is important for proper bristle and hair polarity on &e wing, leg, tiiorax, 

10 abdomen, and eye of the developing insect (Wang, Y. et aL (1996) J. BioL ChenL 

271 :4468-4476). The ^y^l^ gene encodes a 587 amino acid protein ^ch contains an 
N-terminal signal sequence and seven putative transmembrane regions. The cysteine-rich 
N-terminus is probably extracellular and the C-terminus is probably cytosolic. Multiple 
gene homologs have been found in rat, mouse, and human. The frizzled receptors 

15 are not homologous to other seven-transmembrane-region receptors and their ligands are 
still unknown. 

T cells play a dual role in the immime system as effectors and regulators, coupling 
antigen recognition and the transmission of signals that induce cell death in infected cells 
and stimulate other immune cells. Although T cells recognize a wide range of different 

20 antigens, a particular clonal line of T cells can only recognize a single antigen and only 
when it is presented to the T cell receptor (TCR) as a peptide complexed with a major 
histocompatibility molecule (MHC) on the surface of antigen presenting cell. The TCR on 
most T cells consists of inmiunoglobulin-like integral mmibrane glycoproteins containing 
two polypeptide subunits, a and P, of similar molecular weight TheTCRPsubunithasan 

25 extracellular domain containing both variable and constant regions, a transmembrane 

domain that traverses the membrane once, and a short intracellular domain (Saito, H. et aL 
(1984) Nature 309:757-762). The genes for tiie TCR subunits are constructed through 
somatic rearrangement of different gene segments. Interaction of antigen in the proper 
MHC cont^ with the TCR initiates signaling cascades tiiat induce the proliferation, 

30 maturation, and function of cellular components of the innnime systrai (Weiss» A. (1991) 
Annu. Rev. Genet 25: 487-5 1 0). Rearrangements in TCR genes and alterations in TCR 
expression have be» noted in lymphomas, leukemias, autoimmune disorders, and 



-4- 



wo 99^70 



PCTAJS99/«»9191 



immunodefici eucy dtsuiifeis (Aisenberg, A.C. et al. (1985) N. Engl. J. Med. 313:529-533; 
Weiss, supra: and Olive, supraV 

Other potential membrane-spanning and membrane protein-interacting proteins 
with putative receptor function include the mufl protein; MARCO; the transmembrane 4 
5 family (TM4) of proteins; the dopamine, serotonin, and muscarinic receptors; and 
prenylated proteins. 

Abnormal hormonal secretion is linked to disorders including diabetes insipidus, 
hyper- and hypoglyconia. Grave's disease and goiter, and Cushing's and Addison's 
diseases. Cancer cells secrete excessive amounts of hormones or other biologically active 

10 peptides. Disorders related to excessive secretion of biologically active peptides by tumor 
cells include fastmg hypoglycemia due to increased insulin secretion fiom insulinoma-islet 
cell tumors; hypertension due to increased epinq>hrine and norepinq)hrine secreted firom 
pheochromocytomas of the adrenal medulla and sympathetic para^mglia; and carcinoid 
syndrome, which includes abdominal cranqis, dianhea, and valvular heart disease, caused 

IS by excessive amounts of vasoactive substances secreted fh)m intestinal tumors. Tumors 
may exhibit ectopic synthesis and secretion of biologically active peptides, including 
ACTH and vasopressin in limg and pancreatic cancers; parathyroid hormone in lung and 
bladder cancers; calcitonin in lung and breast cancers; and thyroid-stimulating hormone in 
medullary thyroid carcinoma 

20 Inflammation is a molecular, cellular, and tissue program during which foreign 

substances and pathogens are destroyed, and injured tissue is repaired through a variety of 
biochemical, biophysical, and cellular mechanisms. The principal cellular mediators of 
inflammation are leukocytes, particularly granulocytes and the monocytes/macrophages. 
Macrophages recognize, intemalize, and destroy a variety of foreign (non-self) and 

25 endogenous substances and pathogens, including bacteria, parasites, and viruses. The 

exact recognition mechanism for non-self pathogens is unknown, but it has been proposed 
that receptors with broad binding specificity are used to discriminate between self and 
non-self antigens. Macrophages are also thought to play in important role in the imip^iii^ 
resfponse by presenting foreign antigens to lymphocytes. 

30 Steroid hormones regulate niany cellular and tissue functions. Progesterone, a 4- 

pregnene-3,20-dione derived from cholesterol, is a critical oscillating component of the 
female reproductive cycle. These oscillations correlate with anatomical and 
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morphological changes including menstniation and pregnancy. 

The activities of progesterone are mediated through the intracellular progesterone 
receptor (PR). In the cytoplasm PR associates with several other proteins and &ctors 
termed the PR heterocompiex (PRC). PR is inactive when bound by molecular 
5 ck^rones, inmiunophillins, and the heat shock proteins (hsp70» hsp90, hsp27, and pS9 
(hsp56), p48 and p23; Johnson, J.L. et al. (1994) Mol. Cell. Biol. 14:1956-1963). Active 
PR binds piogestarone and translocates to the nucleus where it binds as a transcription 
factor to canonical DNA transcriptional elements of progesterone-regulated genes 
implicated in differentiadon and in the cell cycle (Moutsatsou, P and Sekeris, C.E. (1997) 
10 Ann. N.Y. Acad. Set. 816:99-1 IS). 

Other non-membrane interacting receptor proteins include the small nuclear 
ribosomal proteins; the yeast growth-related SIS2 protein, single-stranded DNA-binding 
proteins, RAG-1 activating proteins, and the hamster FAR-17a protein. 

The discovery of new human receptor molecules and the polynucleotides encoding 
15 them satisfies a need in the art by providing new compositions which are useful in the 
diagnosis, treatment, and prevention of neoplastic, immunological, reproductive, 
gastrointestinal, nervous, smooth muscle, and musculoskeletal disorders. 

SUMMARY OF THE INVENTION 
The invention features substantially purified polypeptides, human receptor 
20 molecules, referred to collectively as "RECT. In one aspect, the invention provides a 
substantially purified polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID N0s:l-16, and fragments thereof. 

The invention further provides a substantially purified variant having at least 90% 
amino acid identity to the amino acid sequences of SEQ ID NOs:l-16, and fiiagments 
25 thereof The invention also provides an isolated and purified polynucleotide encoding the 
polypeptide comprising an amino acid sequence selected fix>m the group consisting of 
SEQ ID N0s:l-16, and fragments thereof. The invention also includes an isolated and 
purified polynucleotide variant having at least 90% polynucleotide sequence identity to the 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected 
30 from the group consisting of SEQ ID N0s:l-16, and fragments thereof. 

Additionally, the invention provides an isolated and purified polynucleotide which 
hybridizes under stringent conditions to the polynucleotide encoding the polypeptide 
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comprising an auiiuu atid stqueuce selected from the group consisting of SEQ ID NOs: 1- 
16, and fragments thereof, as well as an isolated and purified polynucleotide having a 
sequence vMch is complementary to the polynucleotide encoding the polypeptide 
comprising the amino acid sequence selected from the group consisting of SEQ ID NOs: 1- 

5 16, and fragments thereof. 

The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected &om the group consisting of SEQ ID NOs:17-32, and 
fiagments thoeof. The invention fiuther provides an isolated and purified polynucleotide 
variant having at least 90% polynucleotide sequence identity to the polynucleotide 

10 sequence comprising a polynucleotide sequence selected from the group consisting of 
SEQ ID NOs:17-32» and fi:agments thereof, as well as an isolated and purified 
polynucleotide havmg a sequence which is complementary to the polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NOs:17-32, and fragments thereof. 

1 3 The invention fiirther provides an expression vector containing at least a fi:agment 

of the polynucleotide encoding the polypeptide comprising an amino acid sequence 
selected from the group consistihg of SEQ ID NOs: 1 -1 6, and fiiigments thereof. In 
another aspect, the expression vector is contained within a host cell. 

The invention also provides a method for producing a polypeptide comprising the 

20 amino acid sequence selected from the group consisting of SEQ ID N0s:l-16, and 
fragments thereof, the method comprising the steps of: (a) culturing the host cell 
containing an expression vector containing at least a fragment of a polynucleotide 
encoding the polypeptide under conditions suitable for the expression of the polypeptide; 
and (b) recovering the polypeptide fix>m the host cell culture. 

25 The invention also provides a pharmaceutical composition comprising a 

substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID N0s:l-16, and fragments thereof in conjunction with a suitable 
pharmaceutical carrier. 

The invention fiirther includes a purified antibody which binds to a polypeptide 

30 comprising the amino acid sequence selected fix>m the groiq> consisting of SEQ ID NOsrl- 
16, and fiagments thereof, as well as a purified agonist and a purified antagonist to the 
polypeptide. 
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The invention also pnnrides a method for treat in g or preventing a neopiastic 
disorder, the method comprising administering to a subject in need of such treatment an 
efifective amount of an antagonist of the polypeptide having an amino acid sequence 
selected fipom the groiq) consisting of SEQ ID N0s:l-16, and fiagments thereof. 

S The invention also provides a method for treating or preventing an immunological 

disorder, the method comprising administering to a subject in need of such treatment an 
effective amount of an antagonist of the polypeptide having an amino acid sequence 
selected fiom the group consisting of SEQ ID N0s:l-16, and fragments thereof. 

The invention also provides a method for treating or preventing a reproductive 

10 disorder, the method comprising administering to a subject in need of such treatment an 
effective amount of an antagonist of the polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NOs:l-16, and fiagments thereof. 

The invention also provides a method for treating or preventing a gastrointestinftl 
disorder, the method comprising administering to a subg ect in need of such treatment an 

15 effective amount of an antagonist of the polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NOs:l-16, and fiagments thereof. 

The mvention also provides a mediod for treating or preventing a nervous disorder, 
the method comprising administering to a subject in need of such treatment an effective 
amount of an antagonist of the polypeptide having an amino acid sequence selected fix>m 

20 the group consisting of SEQ ID N05:l-16, and fiagments thereof. 

The invention also provides a method for treating or preventing a smooth muscle 
disorder, the method comprising administering to a subject in need of such treatment an 
effective amount of an antagonist of the polypeptide having an amino acid sequence 
selected fiom the group consisting of SEQ ID NOs: 1-16, and fiagments thereof. 

25 Tlie invention also provides a method for treating or preventing a musculoskeletal 

disorder, the method conqxrising administering to a subj ect in need of such treatment an 
effective amount of an antagonist of the polypeptide having an amino acid sequence 
selected fiom the group consisting of SEQ ID N0s:l-16, and fiagments thereof. 

The invention also provides a method for detecting a polynucleotide encoding the 

30 polypeptide comprising the amino acid sequence selected fiom the groiq> consisting of 
SEQ IDN0s:l-16, and fiagments thereof in a biological sample containing nucleic acids, 
the method comprising the steps of: (a) hybridizing the complement of the polynucleotide 



-8- 



wo 99/57270 PCTA)S99/09191 

sequence encoding the polypeptide cuinpiLiiiig Qie amioo acid sequence selected from the 
group consisting of SEQ ID N0s:l-16, and fragments thereof to at least one of the nucleic 
acids of the biological sample, thereby forming a hybridization complex; and (b) detecting 
the hybridization complex, \^arein the presence of the hybridization complex correlates 

5 with the presence of a polynucleotide encoding the polypeptide in the biological san^ie. 
In one aspect, the nucleic acids of the biological sample are amplified by the polymerase 
chain reaction prior to the hybridizing step. 

BRIEF DESCRIPTION OF THE TABLES 
The first column of table 1 shows protein sequence identification numbers, SEQ ID 

10 NOs:I-16. The second column shows the nucleotide sequence identification numbers 
SEQ ID NOs: 17-32 of the consensus sequraces i^ch encode SEQ ID NOs:l-16. The 
third column lists the Incyte Clone ID associated with the sequmce identification number. 
The fourth column lists the tissue Ubrary from which the Incyte Clone was isolated. The 
fifth column lists the overliving and/or extended nucleic acid sequences which were used 

15 to derive the consensus sequences SEQ ID NOs: 1 7-32. 

The first column of table 2 lists the protein sequence identification numbers. The 
second column shows the number of amino acids of SEQ ID NOs: 1-16. The third column 
lists the potential phosphorylation sites available to cAMP- and cGMP-dependant protein 
kinases, casein kinase II, protem kinase C, and/or tyrosine kinases present in SEQ ID 

20 NOs:l-16. The fourth column lists the potential N-glycosylation sites present in SEQ ID 
NOs:l-16. The fiftti column lists any significant protein &mily signature or 
ligand/substrate binding motif present in SEQ ID N0s:l-16. The sixth column names the 
GenBank database homolog widi highest log-likelihood score of SEQ ID NOs: 1-16. The 
seventh coiunm describes the method of analysis or aIgorithm(s) used to identify SEQ ID 

25 NOs:l-16. 

The first column of table 3 lists the sequence identification number (SEQ ID 
NOs: 1-1 6). The second column lists the tissue expression and fraction of tissue which 
esqiress SEQ ID NOs: 1-16. The third column lists the disease class and fraction of total 
diseases that express SEQ ID N0s:l-16. The fourth column lists the vector used to 
30 subclone the cDNA library. 

DESCRIPTION OF THE INVENTION 
Before the present proteins, nucleotide sequences, and methods are described, it is 
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understood that tliis iuvcndon is not iimhod to tte particalaF methodology, protocols, cell 
lines, vectors, and reagents described, as these may vary. It is also to be miderstood that 
the terminology used herein is for the purpose of describing particular embodiments only, 
and is not intended to limit the scope of the present invention which will be limited only 

5 by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms 
''a,** "^an,** and 'the*^ include plural referrace unless the context clearly dictates otherwise. 
Thus, for example, a reference to ""a host ceir includes a plurality of such host cells, and a 
reference to **an antibody" is a reference to one or more antibodies and equivalents thereof 

1 0 known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meanings as commonly understood by one of ordinary skill in the art to vAdch this 
invention belongs. Although any methods and matpoftis similar or equivalent to those 
described herein can be used m the practice or testing of die piesent invention, the 

1 5 preferred methods, devices, and materials are now described. All publications mentioned 
herein are cited for the purpose of describing and disclosing the cell Imes, vectors, and 
methodologies which are reported m the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

20 DEFfNITIONS 

"REC refers to the amino acid sequences of substantially purified REC obtained 
from any species, particularly a mammalian species, including bovine, ovine, porcine, 
murine, equine, and preferably the himian species, fiiom any source, whether natural, 
synthetic, semi-synthetic, or recombmant 

25 The term "agonist'* refers to a molecule which, when bound to REC, increases or 

prolongs the duration of the effect of REC. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of REC. 

An "^allelic variant" refers to an alternative form of the gene encoding REC. 
Allelic variants may result fiom at least one mutation in the nucleic acid sequence and 

30 may result in altered mRNAs or in polypeptides whose structure or function may or may 
not be altered. Any given natural or recombinant gene may have none, one, or many 
allelic forms. Conunon mutational changes vAdch give rise to allelic variants are generally 
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ascribed to natural deletions, additions, or subsdtutioi^ Each of these 

types of changes may occur alone, or in combination with the others, one or more times in 
agivaisequoice. 

Altered" nucleic acid sequences encoding REC include tiiose sequences with 
deletions, insertions, or substitutions of different nucleotides, resulting in a polynucleotide 
the same as REC or a polypeptide with at least one functional characteristic of REC. 
Included within this definition are polymorphisms ^^lich may or may not be readily 
detectable using a particular oligonucleotide probe of tiie polynucleotide encoding REC, 
and improper or unexpected hybridization to allelic variants, witii a locus other than the 
normal chromosomal locus for the polynucleotide sequence encoding REC. The encoded 
protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent 
REC. DeUberate ainino acid substitutions niay be made on the basis of siniilarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature 
of tiie residues, as long as the biological or immunological activity of REC is retained. 
For example, negatively charged amino acids may include aspartic acid and glutamic acid, 
positively charged amino acids may include lysine and arginine, and anuno acids with 
uncharged polar head groups having similar hydrophilicity values may include leucine, 
isoleucine, and valine; glycine and alanine; asparagine and glutamine; serine and 
threonine; and phenylalanine and tyrosine. 

The terms "amino acid^ or ^amino acid sequence** refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a firagment of any of these, and to naturally occurring 
or synthetic molecules. In this context, "fragments,** "immunogenic fi-agments," or 
"antigenic fragments'* refer to fragments of REC which are preferably about 5 to about 20 
amino acids in length, most preferably 1 S amino acids, and which retain some biological 
activity or immunological activity of REC. 

"Amplification"" relates to the production of additional copies of a nucleic acid 
sequence. Amplification is generally carried out using polymerase chain reaction (PCR) 
technologies well known in the art. 

"Antagonist" refers to a molecule yMch, when bound to REC, decreases the 
amount or the duration of the effect of the biological or inununological activity of REC. 
Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other 
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molecules ^ch decwjGiu dio affutt of REG. 

"^Antibody** refers to intact molecules as well as to fiagments thereof, such as Fab, 
F(ab')2, and Fv fiagments, which are capable of binding the epitopic determinant 
Antibodies that bmd REC polypeptides can be prepared using intact polypeptides or using 
5 fragments containing small peptides of interest as the immunizing antigen. The 

polypeptide or oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) 
can be derived firom the translation of KNA, or synthesized chemically, and can be 
conjugated to a carrier protein if desired. Commonly used carriers that are chemically 
coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole limpet 

10 hCTlOCyanin (KLH). The coupled peptide is then used tn immunise the aninial, 

"Antigenic determinant** refers to that fragment of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fiagment of a protem is 
used to immunize a host animal, numerous regions of the protein may induce the 
production of antibodies \4iich bind specifically to antigenic detenninants (given regions 

15 or three-dimensional structures on the protein). An antigenic determinant may compete 
with the intact antigen (i.e., the inununogen used to elicit the immune response) for 
binding to an antibody. 

^'Antisense** refers to any composition containing a nucleic acid sequence which is 
complementary to the "sense** strand of a specific nucleic acid sequence. Antisense 

20 molecules may be produced by any method including synthesis or transcription. Once 
introduced into a cell, the complementary nucleotides combine with natural sequences 
produced by the cell to form duplexes and to block either transcription or translation. Hie 
designation "negative** can refer to the antisense strand, and the designation "positive** can 
refer to the sense strand. 

25 "Biologically active** refers to a protein having structural, regulatory, or 

biochemical fimctions of a naturally occurring molecule. Likewise, "immunologically 
active'* refers to the cqiability of the natural, recombinant, or synthetic REC, or of any 
oligopeptide thereof^ to induce a specific immune response in £q[>propriate anitnalg or cells 
and to bind with specific antibodies. 

30 A "composition comprising a given polynucleotide sequence** or a "composition 

comprising a given amino acid sequence** refer broadly to any composition containing the 
given polynucleotide or amino acid sequence. The composition may comprise a diy 



wo 99/57270 PCT/US99/09191 

fonnulatioii, an aqueous scAutko»<N[^ a steri Ck)n90»tk>iis comprising 

polynucleotide sequences encoding REC or fragments of REC may be employed as 
hybridization probes. The probes may be stored in fieeze-dried fonn and may be 
associated vdth a stabilizing agent such as a cartx>hydrate. In hybridizations, the probe 

5 may be deployed in an aqueous solution containing salts, e.g., NaCl, detergents, e.g., 
sodium dodecyl sulfate (SDS), and other components, e.g., Denhardf s solution, diy milk, 
salmon sperm DNA, etc. 

*^Consensus sequence" refers to a nucleic acid sequence which has been 
resequenced to resolve uncalled bases, extended using XL-PCR (Pokin Ehner, Norwalk, 

10 CT) in the S* and/or the 3* direction, and resequenced, or >^ch has been assembled fiom 

the overlapping sequences of more than one Incyte Clone using a computer program for 

fragment assembly, such as the GELVIEW Fragment Assembly system (GCG, Madison, 

WI). Some sequences have been both extended and assembled to produce the coriseiisus 
sequence. 

IS The phrase **correlates with expression of a polynucleotide'* indicates that the 

detection of the presence of nucleic acids, the same or related to a nucleic acid sequence 
encoding REC, by Northern analysis is indicative of the presence of nucleic acids 
encoding REC in a sample, and thereby correlates with expression of the transcript fiom 
the polynucleotide encodmg REC. 

20 A "deletion'* refers to a change in the amino acid or nucleotide sequence that 

results in the absence of one or more amino acid residues or nucleotides. 

"Derivative** refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can 
include, for example, replacement of hydrogen by an alkyl, acyl, or amino group. A 

25 derivative polynucleotide encodes a polypeptide which retains at least one biological or 
immunological function of the natural molecule. A derivative polypeptide is one modified 
by glycosylation, pegylation, or any ^milar process that retams at least one biological or 
immunological function of the polypeptide fiom vrfiich it was derived. 

"Similarity** refers to a degree of complementarity. There may be partial similarity 

30 or complete similarity. The word "identity** may substitute for the word "similarity.** A 
partially complementary sequence that at least partially inhibits an identical sequence fiom 
hybridizing to a target nucleic acid is referred to as "substantially similar.** The mhibition 
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of hybridization of Aie c omp to te ly coB^slenientary seqimce to the target sequence may be 
examined using a hybridization assay (Southern or Northern blot» solution hybridization, 
and the like) under conditions of reduced stringency. A substantially similar sequence or 
hybridization probe will compete for and inhibit the binding of a completely similar 

S (identical) sequence to the target sequence under conditions of reduced stringency. This is 
not to say that conditions of reduced stringency are such that non-specific binding is 
permitted, as reduced stringency conditions require that the binding of two sequences to 
one another be a specific (i.e., a selective) interaction. The absence of non-specific 
binding may be tested by the use of a second target sequmce vMdb. lacks even a partial 

10 degree ofcompIementarity(e.g., less than about 30% similarity or identity). In the 
absence of non-q)ecific bmding, the substantially similar sequence or probe will not 
hybridize to the second nourcomplementary target sequence. 

The phrases "^percent identity^ or **Vo identity^ refer to the percentage of sequence 
similarity found in a con^)arison of two or more amino acid or nucleic acid sequences. 

15 Percent identity can be determined electronically, e.g., by using the MEGALIGN program 
(DNASTAR, Inc., Madison WI). The MEGALIGN program can create alignments 
between two or more sequences according to different methods, e.g., the clustal method. 
(See, e.g., Higgins, D.G. and P.M. Sharp (1988) Gene 73:237-244.) The clustal algorithm 
groups sequences into clusters by examining the distances between all pairs. The clusters 

20 are aligned pairwise and then in groups. Thepercentagesimilarity between two amino 
acid sequences, e.g., sequence A and sequence B, is calculated by dividing the length of 
sequence A, minus the number of g^ residues in sequence A, minus the number of g^ 
residues in sequence B, into the sum of the residue matches between sequence A and 
sequence B, times one himdred. Gaps of low or of no similarity between the two amino 

25 acid sequences are not included in determining percentage similarity. Percent identity 

between nucleic acid sequences can also be counted or calculated by other methods known 
in the art, e.g., the Jotun Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 
1 83:626-645.) Identity between sequences can also be determined by other methods 
known in the art, e.g., by varying hybridization conditions. 

30 ""Human artificial chromosomes" (HACs) are linear microchromosomes v^ch may 

contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of die 
elements required for stable mitotic chromosome segregation and maintenance. (See, e.g.. 
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Harrington. J J. et al. (1997) Nat Genet 15:345*355.) 

A ^humanized antibody^iefers to antibody nK)lecules in which the amino add 
sequence in the non-antigen binding regions has been altered so that the antibody more 
closely resembles a human antibody, and still retains its original binding ability. 
5 "Hybridization" refers to any process by which a strand of nucleic acid binds with a 

complementary strand through base pairing. 

"Hybridization complex"" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g.. Cot or Rot analysis) or immobilized 
10 on an £qn>ropriate substrate. 

"Insertion"" or "addition"" refer to changes in an amino acid or nucleotide sequmce 
resulting in the addition of one or more amino acid residues or nucleotides, respectively, to 
the sequence found in the naturally occurring molecule. 

"Immune response"" can refer to conditions associated with inflammation, trauma, 
- 15 immune disorders, infectious or genetic disease, etc. These conditions can be characterized 
by expression of various factors, e.g., cytokines, chemokines, and other signaling 
molecules, ^^ch may afifect cellular and systemic defense systems. 

"Microarray"" refers to an arrangement of distinct polynucleotides, i.e., army 
elements, on a substrate. 
20 "Modulation"" refers to a change in the activity of REC. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or inmiimological properties of REC. 

"Nucleic acid seqiience" refers to an oligomer, oligonucleotide, nucleotide or 
polynucleotide, and its fragments, and to DN A or RNA of genomic or synthetic origin 
25 which may be single- or double-stranded, and represent the sense or complementary 

(antisense) strand, a peptide nucleic acid (PNA), or a any DNA-like or RNA-like material. 
In this context, "fragments'" refers to those nucleic acid sequences which are useful as 
probes or to produce an amino acid sequence \^ch displays a useful biological or 
functional characteristic. 
30 The terms "operably associated"" or "operably linked"" refer to functionally related 

nucleic add sequences. A promoter is operably associated or operably linked with a coding 
sequence if the promoter controls the transcription of the nucleic acid sequence. While 
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operably associated or epeoM^ tudced suclek acid seqHeoees cm be contiguous and in tibe 
same reading fiame, c^tain goietic elraimts, e.g., repressor genes, are not contiguously 
linked to the sequence encoding the polypeptide but still bind to operator sequences that 
control expression of the polypeptide. 

5 "Oligonucleotide'* refers to a nucleic acid sequence of at least about 6 nucleotides to 

60 nucleotides, preferably about IS to 30 nucleotides, and most preferably about 20 to 25 
nucleotides, which can be used in PCR amplification, in hybridization, or on a microarray. 
"Oligonucleotide'' is substantially equivalent to the terms '*amplimer,*' **primer," 
"oligomer," and "probe" as commonly defined in the art 

10 "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti*gene agent 

which comprises an oligonucleotide of at least about 5 nucleotides in length linked to a 
peptide backbone of amino acid residues ending in lysine. The terminal lysine confers 
solubility to the composition vMch may also be pegylated to extend persistence in the cell. 
PNAs preferratially bind complementary single stranded DNA or RNA and stop 

15 replication, transoiption or translation, transcript elongation, which may be pegylated to 
extend their lifespan in the cell. 

A biological "sanq>le'' refers to an extract finom a cell, the cell, chromosomes 
isolated firom a cell, genomic DNA, RNA, or cDNA in solution or bound to a substrate, 
REC, protein or fragments thereof, a bodily fluid, membrane isolated fiom a cell, etc. 

20 "Specifically binding" refers to that interaction between a protein or peptide and an 

agonist, an antibody, or an antagonist The interaction is dependent upon the presence of a 
particular structure of the protein, e.g., the antigenic determinant or epitope, recognized by 
the bmding molecule. For example, if an antibody is specific for epitope "A," the presence 
of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a reaction 

25 containing firee labeled A and the antibody will reduce the amount of labeled A that binds to 
the antibody. 

"Substantially purified" refers to nucleic acid or amino acid sequences that are 
removed fiom their natural environment and are isolated or separated, and are at least about 
60% firee, preferably about 75% fi:ee, and most preferably about 90% fi:ee firom other 
30 components with yAdch they are naturally associated. 

A "substitution" refers to the replaconent of one or more amino acids or nucleotides 
by different amino acids or nucleotides, respectively. 
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"^Substrate*" Kfin^t^aay siikable rigid or semMt^ si^jporl including membranes; 
filths, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, 
poljoners, micropardcles and capillaries. The substrate can have a variety of sur&ce forms, 
such as wells, trenches, pins, channels and pores, to which the polynucleotides are boimd. 

5 'Transformation'' describes a process by which exogenous DNA enters and changes 

a recipient cell. Transformation may occur under natural or artificial conditions according 
to various methods well known in the art, and may rely on any known method for the 
insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host celL The 
method for transformation is selected based on the type of host cell being transformed and 

10 may include, but is not limited to, viral infection, electroporation, heat shock, lipofection, 
and particle bombardment The term "transformed" cells includes stably tnmsformed cells 
in ydiich the inserted DNA is cq)able of rq>lication either as an autonomously replicating 
plasmid or as part of the host chromosome, as well as transiently transformed cells which 
repress the inserted DNA or RNA for limited periods of time. 

1 5 *^arianr refers to an amino acid sequence that is altered by one or more amino 

acids. The variant may have "conservative" changes, wherein a substituted amino acid has 
similar structural or chemical properties (e.g., replacement of leucine with isoleucine). 
More rarely, a variant may have "nonconservative** changes (e.g., replacement of glycine 
with tryptophan). Analogous minor variations may also include amino acid deletions or 

20 insertions, or both. Guidance in determining which amino acid residues may be substituted, 
inserted, or deleted without abolishing biological or immunological activity may be found 
usmg computer programs well known in the art, for example, LASERGENE NAVIGATOR 
software. 
THE INVENTION 

25 The invention is based on the discovery of new human receptor molecules, REC, the 

polynucleotides encoding REC, and the use of these compositions for the diagnosis, 
treatment, or prevention of neoplastic, immimological, reproductive, gastrointestinal, 
nervous, smooth muscle, and musculoskeletal disorders. Table 1 shows the protein and 
nucleotide SEQ ID NOs, Incyte Clone ID , library from vMch the cDNA was dmved, and 

30 the overlqypmg and/or extended nucleic acid sequences, (idratified by Incyte clone number 
and Ubraiy) associated with the nucleic acid sequence for each of the human receptor 
molecules disclosed in the Sequence Listing. 
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As shown ialaWfe^wjl^REC h as b een cfaggactei^wfeyegwt to its chanical 
and stnictui^ similarity A In table 3, northern analysis shows tbe 

expression of this sequrace in various libraries, at least 33% of which are inunortalized or 
cancerous, at least 13% are in fetal tissue, and at least 13% of which involve immune 
5 response. Of particular note is the e^qpression of REC in reproductive, gastrointestinal, 
nervous, smooth muscle, musculoskeletal, and endocrine tissues. 

The invention also encompasses REC variants. A preferred REC variant is one 
which has at least about 80%, more preferably at least about 90%, and most preferably at 
least about 95% amino acid sequence identity to the REC amino acid sequence, and which 
10 contains at least one functional or structural characteristic of REC. 

The invention also encompasses polynucleotides which encode REC. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising the 
sequence selected from the group consisting of SEQ ID NOs:17-32, which encode REC. 

The invention also encompasses a variant of a polynucleotide sequence encoding 
15 REC. In particular, such a variant polynucleotide sequence vnU have at least about 80%, 
more preferably at least about 90%, and most preferably at least about 95% polynucleotide 
sequence identity to the polynucleotide sequence encoding REC. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of polynucleotide sequences encoding REC, some bearing 
20 mmimal similarity to the polynucleotide sequences of any known and naturally occurring 
gene, may be produced. Thus, the invention contemplates each and every possible variation 
of polynucleotide sequence that could be made by selecting combinations based on possible 
codon choices. These combinations are made in accordance with the standard triplet 
genetic code as applied to the polynucleotide sequence of naturally occurring REC, and all 
25 such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode REC and its variants are preferably 
capable of hybridizing to the nucleotide sequence of the naturally occurring REC under 
appropriately selected conditions of stringency, it may be advantageous to produce 
nucleotide sequences encoding REC or its derivatives possessing a substantially different 
30 codon usage by inclusion of non-naturally occurring codons. Codons may be selected to 
increase the rate at which expression of die peptide occurs in a particular prokaryotic or 
eukaryotic host in accordance with the frequencgr with which particular codons are utilized 
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by the host Other reasons for giihgtanti^Hy .aUenag tb&QUcUotide < M^ MPn**f ^ 5M line REC 
and hs derivatives without altering the mcoded amino acid sequences include die 
production of RNA transcripts having more desirable properties, such as a greater half-life, 
than transcripts produced torn the naturally occurring sequence. 

5 The invention also encompasses production of DNA sequences which encode REC 

and REC derivatives, or firagments thereof entirely by synthetic chemistry. After 
production, the synthetic sequence may be inserted into any of the many available 
expression vectors and cell systems using reagents well known in the art Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence encoding REC or 

10 any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are ciyiable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in 
SEQ ID NOs:17-32, and fiagments thereof under various conditions of stringency. 
^Stringent conditions** refer to conditions vAndi permit hybridization between 

IS polynucleotides. Stringent conditions can be defined by salt concentration, temperature, 
and other chemicals and conditions well known in the art In particular, stringency can be 
increased by reducing the concentration of salt, or raising the hybridization temperature. 

For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and SO mM 

20 trisodium citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium 
citrate. Stringent temperature conditions will ordinarily include temperatures of at least 
about 30**C, more preferably of at least about 37**C, and most preferably of at least about 
42*'C. Varying additional parameters, such as hybridization time, the concentration of 
detergent (sodium dodecyl sulfate, SDS) or solvent (formamide), and the inclusion or 

25 exclusion of carrier DNA, are well known to those skilled in the art. Various levels of 
stringency are accomplished by combining these various conditions as needed. In a 
preferred embodiment. Southern hybridization will occur at 30**C in 750 mM NaCl, 75 mM 
trisodium citrate, and 1% SDS. In a more preferred embodiment. Southern hybridization 
will occur at 37^C in 500 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% formamide, 

30 and 100 Mg/ml denatured sahnon sperm DNA (ssDNA). In a most preferred embodiment, 
Soutiiem hybridization will occur at 42''C in 250 mM NaCl, 25 mM trisodium citrate, 1% 
SDS, 50 % formamide, and 200 ^g/nA ssDNA. Useful variations on these conditions will 
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be leadUy apparent to those skilled in die art. 

The stiingency of washing steps which follow hybridization can also vary as defined 
by decreasing salt concentration or by increasing temperature. For example, stringent salt 
concentration for the wash steps will preferably be less than about 30 mM NaCl and 3 mM 

5 trisodium citrate, and most preferably less than about 15 mM NaCl and 1.5 mM trisodium 
citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25**Q more preferably of at least about 42**C, and most 
preferably of at least about 68°C. In a preferred embodiment, wash steps will occur at 2S^C 
in 30 mM NaCl, 3 mM trisodium citrate, and 0.1% SDS. In a more prefened embodunent, 

10 wash steps will occur at 42**C in 1 5 mM NaCl, 1 .5 mM trisodium citrate, and 0.1% SDS. 
In a most preferred embodiment, wash steps will occur at 68X in IS mM NaCU 1.S mM 
trisodium citrate, and 0.1% SDS. Additional variations on these conditions will be readily 
apparent to those skiUed in the art (Wahl, G.M. and S.L. Becger (1987) MeAods En^ol. 
152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-51 1; Ausubel, F.M. et aL 

15 (1997) Short Protocols in Molecular Biolopv John Wiley & Sons, New York NY, and 
Sambrook. J. et al. (1989) Molecular Cloning. A I^hnratnrv MAnna^ Cold Spring Harbor 
Press, Plainview NY). 

Methods for DNA sequencing are well known and generally available in the an and 
may be used to practice any of the embodiments of the invention. The methods may 

20 employ such enzymes as the Klenow fragment of DNA polymerase I, SEQUENASE^ 
(Amersham Pharmacia Biotech, Piscataway NJ), Taq polymerase (Pericin Ehner), 
thermostable T7 polyma:ase (Amersham Pharmacia Biotech), or combinations of 
polymerases and proofreading exonucleases such as those found in the ELONGASE 
Amplification System (Life Technologies, Gaithersburg MD). Preferably, sequence 

25 preparation is automated with machines such as the the ABI Catalyst 800 (Perkin Ehner) or 
a Hamilton Micro Lab 2200 (Hamilton, Reno NV) in combmation with thermal cyclers (for 
PGR). Sequencing is then carried out using either DNA sequencers (Perkin Ehner) or 
capillary electrophoresis (Molecular Dynamics). 

The nucleotide and/or amino acid sequences of the Sequence Listing can be used to 

30 query sequences in tiie GenBank primate (pri), rodmt (rod), and mammalian (mam), 
vertebrate (vrtp), and eukaryote (cukp) databases, SwissProt, BLOCKS (Bairoch, A. et al. 
(1997) Nucleic Acids Res. 25:217-221), PFAM, and other databases vAdch contain 
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previously idenlified and fipnntate4.l»0l|fe ^nrj *rpqiimces. M^bods such as those v^ch 
deal with primaiy sequence patterns and secondary structuie gq) penalties (Smith, T. et al. 
(1992) Protein Engineering 5:35-51) and programs and algorithms such as BLAST (Basic 
Local Alignment Search Tool; Altschul, S.F. (1993) J. MoL Evol 36:290-300; and Altschul 

5 et al. (1990) J. Mol. Biol. 215:403-410), BLOCKS (HenikoffS. and HemkoflFG,J. (1991) 
Nucleic Acids Research 19:6565-6572), Hidden Markov Models (HMM; Eddy, S.R. (1996; 
Cur. OpuL Str. Biol. 6:361-365) and Sonnhammer, E.L.L. et al. (1997; Proteins 
28:405-420)), etc. can be used to mampulate and analyze nucleotide and amino acid 
sequences. These databases, programs, algorithms and other methods and tools are well 

1 0 known in the art and are described in Ausubel (supra, unit 7.7) and in Meyers^ ILA. ( 1 995; 
Molecular Biologv and Biotechnologv, WUey VCH, Inc, New York NY, p 856-853). 

The nucleic acid sequences encoding REC may be extended utilizir^ a partial 
nucleotide sequence and employing various PCR-based methods known in tiie art to detect 
iq}5tream sequences, such as promoters and regulatory elements. (See, e.g., Dieffenbach, 

15 CW. and G.S. Dveksler (1995; PGR Primer a T^boTatorv Manual. Cold Spring Harbor 
Press, Phinview, NY, pp.1-5; Saikar, G. (1993; PGR Methods AppKc. 2:31 8-322); Triglia, 
T. et al. (1988; Nucleic Acids Res. 16:8186); Lagerstrom, M. et al. (1991; PGR Methods 
AppUc. l:lll-119);andParker, JJ).etal.(1991;Nucleic AcidsRes. 19:3055-306). 
Additionally, one may use PGR, nested primers, and PROMOTERFINDER libraries to 

20 walk gmomic DNA (Glontech, Palo Alto, GA). This procedure avoids the need to screen 
libraries and is useful in finding intron/exon junctions. For all PGR-based methods, primers 
may be designed using conunercially available software, such as OLIGO 4.06 Primer 
Analysis software (National Biosciences Inc., Plymouth MN) or another appropriate 
program, to be about 22 to 30 nucleotides in length, to have a GG content of about 50% or 

25 more, and to anneal to the template at temperatures of about 68°G to 72X. 

When screening for full-length cDNAs, it is preferable to use libraries that have 
been size-selected to include larger cDNAs. In addition, random-primed libraries, which 
often include sequences containing the 5' regions of genes, are preferable for situations in 
which an oligo d(T) library does not yield a full-length cDNA. Genomic lilnaries may be 

30 useful for extension of sequence into 5' non-transcribed regulatory regions. 

Gapillary electrophoresis systems v^ch are commercially available may be used to 
analyze the size or confirm the nucleotide sequence of sequencing or PGR products. In 
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particular, ciqpillaiy sequencing may employ ilowable polymers for electrophoietic 
separation, four different nucleotide-specific, laser-stimulated fluorescent dyes, and a 
charge coiq>led device camoa for detection ofthe emitted wavelengths. Output/light 
intensity may be converted to electrical signal using appropriate software (e.g., 

5 GENOTYPER and SEQUENCE NAVIGATOR, Perkin Ehner), and the entire process from 
loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA 
fragments which may be present hi limited amounts in a particular sample. 

In another embodiment of the invmtion, polynucleotide sequences or fragments 

1 0 thereof whidi encode REG may be cloned in recombinant DNA molecules that direct 

expression of REC, or fragments or frmctional equivalents thereof in appropriate host cells. 
Due to the inherent degeneracy of the genetic code, other DNA sequences ^ch encode 
substanliaUy the same or a fimctionally equivalent amino acid sequence may be produced 
and used to expms REG. 

15 The nucleotide sequences of the present invention can be engineered using methods 

genially known in the art in order to alter REG-oicoding sequences for a variety of 
purposes including, but not limited to, modification of the cloning, processing, and/or 
expression of the gene product. DNA shufQing by random fragmentation and PGR 
reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the 

20 nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis 
may be used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

In another embodiment, sequences encoding REC may be synthesized, in whole or 
in part, using chemical methods well known in the art (See, e.g., Caruthers, M.H. et al. 

25 (1980) Nucl. Acids Res. Symp. Ser. 215-223; Horn, T. et al. (1980) Nucl. Acids Res. Symp. 
Ser. 225-232; and Ausubel, supra> Alternatively, REC itself or a fragment thereof may be 
synthesized using chemical methods. For example, pq)tide synthesis can be performed 
using various solid-phase tedmiques (Robcrge, J.Y. et al. (1995) Science 269:202-204). 
Automated synthesis may be achieved usii^ the ABI 431A Peptide Synthesizer (Peridn 

30 Elmer). Additionally, the amino acid sequence of REC, or any part thereof, may be altered 
during direct syntbesis and/or combined with sequences fix>m other proteins, or any part 
thereof, to produce a variant polypeptide. 
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The peptide may be substantially puiified by piepacative high perfbnnaiice liquid 
chromatography (Chiez, R.M. and VZ. Regmer (1990) Methods Enzymol. 182:392-421). 
The coiiq>osidon of the synthetic peptides may be confinned by amino acid analysis or by 
sequencing ( Ausubel, supra^ 

5 In order to express a biologically active REC, the nucleotide sequences encoding 

REC or derivatives thereof may be inserted mto an appropriate esqjression vector, Le., a 
vector which contains the necessary elements for transcriptional and translational control of 
the inserted coding sequence in a smtable host These elements include regulatory 
sequences, such as enhancers, constitutive and inducible promoters, and S' and 3' 

10 untranslated regions in the vector and in polynucleotide sequences encoding REC Sudi 
elemratsmayvaiy in their strength and specificity. Specific initiation signals may also be 
used to achieve more efficient translation of sequences encoding REC. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In 
cases where sequences encoding REC and its initiation codon and iq)5tream regulatory 

1 5 sequences are inserted into the appropriate expression vector, no additional transcriptional 
or translational control signals may be needed. However, in cases where only coding 
sequence, or a fiagment thereof is inserted, exogenous translational control signals 
including an in-fiame ATG initiation codon should be provided by the vector. Exogenous 
translational elements and initiation codons may be of various origins, both natural and 

20 synthetic. The efficiency of exinression may be enhanced by the inclu^on of enhancers 
appropriate for the particular host cell system used (Scharf; D. et al. (1994) Results Probl. 
CeU Differ. 20:125-162). 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding REC and ^propriate transcriptional and 

25 translational control elements. These methods include in vitro recombinant DNA 

techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook 
(supra) and Ausubel, (supra) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding REC. These include, but are not limited to, microorganisms such as 

30 bacteria transformed with recombirumt bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast esqpression vectors; insect cell systems infected with 
viral expression vectors (baculovirus); plant cell systems transformed with viral expression 
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vectors, cauiiflowec mosaic vims (CaMV) or tobacco mosaic virus (TMV), or with bacterial 
expression vectors (Ti or pBR322 plasmids); or animal cell systems. The invention is not 
limited by the host cell employed. 

In bacterial systems, a nunber of cloning and expression vectors may be selected 

5 deprading upon the use intended for polynucleotide sequences encoding REC. For 
example, routine cloning, subcloning, and propagation of polynucleotide sequences 
encoding REC can be achieved using a multifunctional E. coli vector such as 
BLUESCRIPl* (Stralagene, U Jolla CA) or pSPORTl plasmid (Life Technologies). 
Ligation of sequences encoding REC into the vector's multiple cloning site dismpts the 

10 lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in 
vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation 
of nested deletions in the cloned sequence (Van Heeke, G. and S.M. Schuster (1989) J. 
Biol. Chem. 264:5503-5509). When large quantities of REC are needed, e.g. for the 

15 production of antibodies, vectors which direct high level expression of REC may be used. 
For example, vectors containing the strong, inducible T5 or T7 bacteriophage promoter may 
be used. 

Yeast expression systems may be used for production of REC. A number of vectors 
containing constitutive or inducible promoters, such as alpha fiictor, alcohol oxidase, and 

20 PGR may be used in the veast Saccharomvces cerevisiae or Pichia pastoris . Inaddition, 
such vectors direct either the secretion or intracellular retention of expressed proteins and 
enable integration of foreign sequences into the host genome for stable propagation 
(Ausubel, supra: Scorer, C. A. et al. (1994) Bio/Technology 12:181-184). 

Plant systems may also be used for expression of REC. Transcription of sequences 

25 encoding REC may be driven by viral promoters such as the 35S and 1 9S promoters of 
CaMV used alone or in combination with the omega leader sequence fiDom TMV 
(Takamatsu, N. (1987) EMBO J. 6:307-3 1 1). Alternatively, plant promoters such as the 
small subunit of RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) 
EMBO J. 3:1671-1680; BrogUe, R. et al. (1984) Science 224:838-843; and Wmtcr, J. et al. 

30 (1991) Results Probl. Cell Differ. 17:85-105). These constructs can be mtroduced mto 
plant cells by direct DNA transformation or pathogen-mediated transfection. f See McGraw 
HiU Yearbook of Science and Technology Mrfiniw Hill Mimr VnrV MV> pp. 
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191-196.) 

In mammalian ceiis^ a number of viral-based eaqgcssion systems may be utilized. In 
cases vfbste an adenovirus is used as an «qxression vector, sequences encodii^ REC may 
be ligated into an adenovirus transcription/translation complex consisting of the late 

5 promoter and tripartite leader sequence. Insertion in a non-essential El or E3 region of the 
viral genome may be used to obtain infective virus which expresses REC in host cells 
(Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RS V) enhancer, may be used to 
increase expression in mammalian host cells. SV40 or EBV-based vedois may also be 

10 used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliv^ larger 
fragments of DNA than can be contained in and expiessed firom a plasmid. HACs of about 
6 kb to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, 
polycationic amino polymers, or vesicles) for therapeutic purposes. 

15 For long term production of recombinant proteins in mammalian systems, stable 

e^qnession of REC in cell lines is prefened. For example, sequences encoding REC can be 
transformed into cell lines using expression vectors which may contain viral origins of 
replication and/or endogenous e3q>ression elements and a selectable marker gene on the 
same or on a separate vector. Following the introduction of the vector, cells may be 

20 allowed to grow for about 1 to 2 days in enriched media before being switched to selective 
media. The purpose of the selectable marker is to confer resistance to a selective agent, and 
its presence allows growth and recovery of cells which successfully express the introduced 
sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

25 Any number of selection systems may be used to recover transformed cell lines. 

These include, but are not limited to, the herpes sin^>lex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in /Ar or opr cells, respectively (Wigler, M. et al. 
(1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823). Also, antimetaboUte, 
antibiotic, or hobicide resistance can be used as the basis for selection. For example, 

30 confers resistance to methotrracate; neo confers resistance to the aminoglycosides neomycin 
and G-418; and als or pat confer resistance to chlorsulfiiron and phosphinotiicin 
acetyltransferase, respectively. Additional selectable genes have been described, e.g., trpB 
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and hisD^ ^ch alt^ ceUiil^ iCQuir^enti&for inetaboikes (Hsfftmsm, &C. and R.C 
MuUigan (1988) Proc. Nad. Acad. Sci. 85:8047-8051). Visible markers, e.g.» amhocyanins, 
green fluorescent proteins (GFP; Clontech, Palo Alto, C A), fi glucuronidase and its 
substrate B-D-glucuronide, or luciferase and its substrate luciferin may be used. These 

5 markers can be used not only to identify transfonnants, but also to quantify the amount of 
transient or stable protein egression attributable to a specific vector system (Rhodes, C A. 
et al. (1995) Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene e^qnession suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. 

10 For example, if the sequence encoding REC is inserted within a marker gene sequence, 
transformed cells containing sequences encoding REC can be identified by the absence of 
maricergenefimction. Alternatively, a mark^ gene can be placed mtand^ with a 
sequence encoding REC under the control of a single promoter. Expression of the marker 
goie in response to induction or selection usually indicates e}q>ression of the tandem gene 

IS as well. 

In general, host cells that contain the nucleic acid sequence encoding REC and that 
express REC may be identified by a variety of procedures known to those of skill in the art 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, 
PCR anqilification, and protein bioassay or unmunoassay techniques which include 

20 membrane, solution, or chip based technologies for the detection and/or qxiantification of 
nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of REC using 
either specific polyclonal or monoclonal antibodies are known in the art Examples of such 
techniques mclude enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays 

25 (RIAs), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on 
REC is preferred, but a competitive binding assay may be employed. These and other 
assays are weU known in the art (See, e.g.> Hamirton, IL et al. (1990^ Serological Methods, 
^U^moryMmml, APS Press, St Paul, MN, Section IV; CoUgan, J. E. et al. (1997 and 

30 periodic supplements) Current Proto cols in Immunology, (heene Pub. Associates and 
Wiley-Interscience, New York, NY; and Maddox, D.E. et al. (1983) J. Exp. Med. 
158:1211-1216). 
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A wide variety of labels and conjugation techniques are known by those skiUed in 
the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides encoding REC include oligolabeling, nick translation, end-labeling, or PCR 

5 amplification using a labeled nucleotide. Alternatively, the sequences encoding REC, or 
any fiagments thereof, may be cloned into a vector for the production of an mRNA probe. 
Such vectors are known in the art, are commercially available, and may be used to 
synthesize RN A probes in vitro by addition of an appropriate RNA polymerase such as T7, 
T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 

10 commercially available kits, such as those provided by Pharmacia & Upjohn (Kalamazoo, 
MI), Promega (Madison, WI), and U.S. Biochemical Corp. (Cleveland, OH). Suitable 
reporter molecules or labels vMch may be used for ease of detection include radionuclides, 
en^mes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
co&ctors, inhibitors, magnetic particles, and the like. 

15 Host cells transformed with nucleotide sequences encoding REC may be cultured 

under conditions suitable for the expression and recovery of the protein from cell culture. 
The protein produced by a transfonned cell may be secreted or retained intracellularly 
depending on the sequence and/or the vector used. As will be understood by those of skill 
in the art, «q[>ression vectors containing polynucleotides which encode REC may be 

20 designed to contain signal sequences which direct secretion of REC through a prokaryotic 
or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of 
the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 

25 glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing 
. vdiich cleaves a "prepro" form of the protein may also be used to specify protein targetmg, 
folding, and/or activity. Different host cells which have specific cellular machinery and 
characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, 
HEK293, and WI38), are available from the American Type Culture Collection (ATCC, 

30 Bediesda, MD) and may be chosen to ensure the correct modification and processing of the 
foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic 

"27- 



wo 99/57270 



PCTAJS99/09191 



acid sequences encoding REC may be Ugated ta a heteiologoiis sequence resultii^ in 
translation of a fusion protein in any of the aforementioned host systems. For example, a 
chimoic REC protein containing a heterologous moiety that can be recognized by a 
commercially available antibody may facilitate the screening of peptide libraries for 

5 inhibitors of REC activity. Heterologous protein and peptide moieties may also facilitate 
purification of fusion proteins using commerciaUy available afiOnity matrices. S\ich 
moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding 
protein (MBP), thioredoxin (Tix), calmodulin binding peptide (CBP), 6-His, FLAG, c-wyc, 
and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 

10 cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, 

cahnoduUn, and metal-chelate resins, respectively. FLAG, c-myc^ and hemagglutinin (HA) 
enable inmnmoafBnity purification of fusion proteins u^ng commercially available 
monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A 
fusion protein may also be engineered to contain a proteolytic cleavage site located between 

1 5 the REC encoding sequence and the heterologous protein sequence, so that REC may be 
cleaved away fiom the heterologous moiety following purification. Methods for fusion 
protein e)q)ression and purification are discussed in Ausubel fsupraV A variety of 
commercially available kits may also be used to facilitate esqiression and purification of 
fijsion proteins. 

20 In a further embodunent of the invention, synthesis of radiolabeled REC may be 

achieved in vitro using the TNT rabbit reticidocyte ly sate or wheat germ extract systems 
(Promega). These systems couple transcription and translation of protein-coding sequences 
operably associated with the T7, T3, or SP6 promoters. Translation takes place in the 
presence of a radiolabeled amino acid precursor, preferably ^^S-methionine. 

25 Fragments of REC may be produced not only by recombinant production, but also 

by direct peptide synthesis using solid-phase techniques (Creighton, supra pp. SS-60). 
Protein synthesis may be performed by manual techniques or by automation. Automated 
synthesis may be achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Peridn 
Ehner). Various fitigments of REC may be synthesized separately and then combined to 

30 produce the fiill length molecule. 
THERAPEUTICS 

Chemical and structural similarity exists among REC and receptor molecules. In 
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addition, RECis expressed ia cangpr , immunoiogicaU fetal, reproductive, gastrointestiiial, 
nervous, smooth muscle, endocrine, and musculoskeletal tissi^. Therefore, REC appears 
to play a role in neoplastic, immunological, reproductive, gastrointestinal, nervous, smooth 
muscle, and musculoskeletal disordos. 

Therefore, in one embodiment, an antagonist of REC may be administered to a 
subject to treat or prevent a neoplastic disorder. Such a neoplastic disorder may include, 
but is not limited to, adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus. In one aspect, an antibody which specifically 
binds REC may be used directly as an antagonist or indirectly as a targeting or delivery 
mechanism for Imnging a pharmaceutical agent to cells or tissue whidi express REC . 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding REC may be admmistered to a subject to treat or prevent a 
neoplastic disorder including, but not limited to, those described above. 

In a further embodiment, an antagonist of REC may be administered to a subject to 
treat or prevent an immunological disorder. Such an inununological disorder may include, 
but is not limited to, acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclero^s, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes meilitus, emphysema, episodic lymphopenia with iymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout. Graves* disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sj&gren's 
syndrome, systemic anq>hylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulca:ative colitis, uveitis, Werner syndrome, compUcations of 
cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, 
protozoal, and helminthic infections, and trauma. In one aspect, an antibody v^ch 
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specifically binds REC may be used directly as m aiitag<»ist or imfoectly as a targeting or 
delivery mechanism for bringing a pharmaceuticai agent to cells or tissue which express 
REC. 

In an additional embodiment, a vector expressing the complement of the 
5 polynucleotide encoding REC may be administered to a subject to treat or prevent an 
immunological disorder including, but not limited to, those described above. 

In a further embodiment, an antagonist of REC may be administered to a subject to 
treat or prevent a reproductive disorder. Such a reproductive disorder may include, but is 
not limited to, disorders of prolactin production; mfertility, including tubal disease, 
10 ovulatory defects, and mdometriosis; disnq)tions of tiie estrous ^cle» disnqitions of the 
menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, 
endometrial and ovarian tumors, uterine fibroids, autoimmune disorders, ectopic 
pregnancies, and teratogmesis; cancer of the breast, fibrocystic breast disease, and 
galactorrhea: disruptions of spermatogenesis, abnormal spcna physiology, cancer of the 
IS testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, 
carcinoma of the male breast, and gynecomastia. In one aspect, an antibody which 
specifically binds REC may be used directly as an antagonist or indirectly as a targeting or 
delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
REC. 

20 In an additional embodiment, a vector expressing the complement of the 

polynucleotide encoding REC may be administered to a subject to treat or prevent a 
reproductive disorder including, but not limited to, those described above. 

In a further embodiment, an antagonist of REC may be administered to a subject to 
treat or prevent a gastrointestinal disorder. Such a gastrointestinal disorder may include, 

25 but is not limited to, dysphagia, peptic esophagitis, esophageal spasm, esophageal stricture, 
esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, 
emesis, gastroparesis, antral or pyloric edema, abdominal angina^ pyrosis, gastroenteritis, 
intestinal obstruction, infections of the intestinal tract, peptic ulcer, cholelithiasis, 
cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary tract disease, hepatoma, 

30 infectious colitis, ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, 
Mallory-Weiss syndrome, colonic carcinoma, colonic obstruction, irritable bowel 
syndrome, short bowel syndrome, dianfaea, constipation, gastrointestinal hemorrhage, and 
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acquired iininunodeficiCTcy^ ayn d ro Bac (AIDS>«ttarop^y, cinhosis, jumdice, cholestasis^ 
hereditaiy hyperbilirubinemia, hepatic encephalopathy, hepatorenal syndrome, hepatitis, 
hepatic steatosis, hemochromatosis, Wilson's disease, apantitiypsin deficiency, Reye's 
syndrome, primary sclerosing cholangitis, liver infarction, portal vein obstruction and 

5 thrombosis, passive congestion, centrilobular necrosis, peliosis hepatis, hepatic vein 
thrombosis, veno-occlusive disease, preeclampsia, eclampsia, acute &tty liver of 
pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including nodular 
hyperplasias, adenomas, and carcinomas. In one aspect, an antibody which specifically 
binds REC may be used directly as an antagonist or indirectly as a targeting or delivery 

] 0 mechanism for bringing a pharmaceutical agent to cells or tissue vAdch express REC . 
In an additional embodiment, a vector expressix^ the complement of the 
polynucleotide encoding REC may be administered to a subj ect to treat or prevent a 
gastrointestinal disorder includmg, but not lunited to, those described above. 

In a fiirther embodiment, an antagonist of REC may be administered to a subj ect to 

IS treat or prevent a nervous disorder. Such a nervous disorder may include, but is not limited 
to, akathesia, Alzheimer's disease, amnesia, amyotrophic lateral sclerosis, bipolar disorder, 
catatonia, cerebral neoplasms, dementia, depression, diabetic neuropathy, Down's 
syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's disease, peripheral 
neuropathy, multiple sclerosis, neurofibromatosis, Paridnson's disease, paranoid psychoses, 

20 postherpetic neuralgia, schizophrenia, and Tourette's disorder. In one aspect, an antibody 
which specifically binds REC may be used directly as an antagonist or indirectiy as a 
targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue v^ch 
express REC. 

In an additional embodiment, a vector expressing the complement of the 
25 polynucleotide encoding REC may be administered to a subject to treat or prevent a 
norvous disorder including, but not limited to, those described above. 

In a fiirther embodiment, an antagonist of REC may be administered to a subj ect to 
treat or prevent a smooth muscle disorder. A smooth muscle disorder is defined as any 
inqpaiiment or alteration in the normal action of smooth muscle and may include, but is not 
30 limited to, angina, anaphylactic shock, arrtiythmias, asthma, cardiovascular shock, 

Cushing's syndrome, hypertension, hypoglycemia, myocardial infiirction, migraine, and 
pheochromocytoma, and myopathies including cardiomyopathy, encephalopathy, epilepsy, 
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Keams^Sayre syndrome, lactic acidosis, myoclonic disorder, and ophthalmoplegia. Smooth 
muscle includes, but is not limited to, that of the blood vessels, gastrointestinal tract, heart, 
and uterus. In one aspect, an antibody ^^ch specifically binds REC may be used directly 
as an antagonist or indirecdy as a targeting or delivery mechanism for bringing a 

5 pharmaceutical agent to cells or tissue which express REC. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide mcoding REC may be administered to a subjea to treat or prevent a smooth 
muscle disorder mcluding, but not limited to, those described above. 

In a further embodiment, an antagonist of REC may be administered to a subject to 

10 treat or prevent a musculoskeletal disorder. Such a musculoskaletal disorder may include, 
but is not limited to, Duchenne's muscular dystrophy, Becker's muscular dystrophy, 
myotonic dystrophy, central core disease, nemaline myopathy, centionuclear myopathy, 
lipid myopathy, mitochondrial myopathy, infectious myositis, polymyositis, 
dermatomyositis, inclusion body myositis, thyrotoxic myopathy, and ethanol myopathy. In 

1 5 one aspect, an antibody vMch specifically binds REC may be used direcdy as an antagonist 
or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
cells or tissue which express REC. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding REC may be administered to a subject to treat or prevent a 

20 musculoskeletal disorder including, but not limited to. those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administmd in combination 
with other appropriate thers^utic agents. Selection of the iq>propriate i^ents for use in 
combination therapy may be made by one of ordinary skill in the art, according to 

25 conventional pharmaceutical principles. The combination of therapeutic agents may act 
synergistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efiScacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of REC may be produced using methods which are generally known 

30 intheart In particular, purified REC may be used to. produce antibodies or to screen 
libraries of phannaoeutical agents to identify those which specifically bind REC. 
Antibodies to REC may also be generated using methods that are well known in the art. 
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Such antibodies mayindnde, but are not limited tn, polychmat, monoclonal, chimeric, and 
single chain antibodies. Fab fragments, and fragments produced by a Fab expression 
library. Neutralizing antibodies (i.e., those ^lich inhibit dimer formation) are especially 
pr e fer red for therapeutic use. 

5 For the production of antibodies, various hosts including goats, rabbits, rats, mice, 

humans, and others may be immunized 1^ injection with REC or with any fragment or 
oligopeptide thereof ^ch has inamunogenic properties. Depending on the host species, 
various adjuvants may be used to increase immunological tesponsi^ Such adjuvants 
include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and 

10 surface active substances such as lysolecithin, pluionic polyols, polyanions, peptides, oil 
emulsions, KLH, and dinitrophenoL Among adjuvants used in humans, BCG (bacilli 
Cahnette-Guerin) and Corvnehacterium parvum are especially preferable. 

It is prefinred that the oligopeptides, peptides, or fragments used to induce 
antibodies to REC have an amino acid sequence consisting of at least about S amino acids, 

IS and, more preferably, of at least about 1 0 amino acids. It is also preferable that these 
oligopeptides, peptides, or fragments arc identical to a portion of the amino acid sequence 
of the natural protein and contain the entire amino acid sequence of a small, naturally 
occurring molecule. Short stretches of REC amino acids may be fiised with those of 
another protein, such as KLH, and antibodies to the chimeric molecule may be produced. 

20 Monoclonal antibodies to REC may be prepared using any technique which 

provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique, the hiunan B-cell hybridoma 
technique, and the EBV-hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 
256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 81:31-42; Cote, RJ. et al. 

25 (1983) Proc. Natl. Acad. Sci. 80:2026-2030; and Cole, S.P. et al. (1 984) MoL Cell Biol. 
62:109-120.) 

In addition, techniques developed for the production of ""chimeric antibodies,** such 
as the splicing of mouse antibody genes to human antibody genes to obtain a molecule with 
appropriate antigen specificity and biological activity, can be used (See, e.g., Morrison, 
30 S.L. et al. (1984) Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 
312:604-608; and Takeda, S. et al. (1985) Nature 314:452-454.) Altmiatively, techniques 
described for the production of single chain antibodies may be ad^ted, using methods 
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known in the art, to produce RE&specific single chain antibodies. Antibodies with related 
specificity, but of distinct idiotypic composition, may be generated by chain shufiQing fiom 
random combinatorial immunoglobulin libraries (Burton D.R. (1991) Proc. Natl. Acad. Sci. 
88:10134-10137). 

5 Antibodies may also be produced by inducing in vivo production in the lymphocyte 

population or by screening immunoglobulin libraries or panels of highly specific binding 
reagents as disclosed in the literature (Orlandi, R. ^ al. (1989) Proc. Natl. Acad ScL 86: 
3833-3837; Winter, G. et al. (1991) Nature 349:293-299). 

Antibody fi:agments vAnch contain specific binding sites for REC may also be 

10 generated. For exariqile, such fiagments include, bm are not lunited to, F(ab02 firagrnents 
produced by pepsin digestion of the antibody molecule and Fab firagments generated by 
reducing the disulfide bridges of the F(ab')2 fiagments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
firagments with the desired specificity (Huse, W.D. et al. (1989) Science 246:1275-1281). 

IS Various inmiunoassays may be used for screening to identify antibodies having the 

desired specificity. Numerous protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established specificities are 
well known in the art Such immunoassays typically involve the measurement of complex 
formation between REC and its specific antibody. A two-site, monoclonal-based 

20 mmiunoassay utilizing monoclonal antibodies reactive to two non-interfering REC epitopes 
is preferred, but a competitive binding assay may also be employed (Maddox, supra) . 

In another embodiment of the invention, the polynucleotides encoding REC, or any 
Augment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding REC may be used in situations in which it 

25 would be desirable to block the transcription of the mRNA. In particular, cells may be 
transformed with sequences complementary to polynucleotides encoding REC. Thus, 
complementary molecules or fiagments may be used to modulate REC activity, or to 
achieve regulation of gene fimctiorL Such technology is now well known in the art, and 
sense or antisense oligonucleotides or larger firagments can be designed firom various 

30 locations along the codmg or control regions of sequences encoding REC. 

Expression vectors derived firom retroviruses, adenoviruses, or herpes or vaccinia 
viruses, or torn various bacterial plasmids, may be used for delivery of nucleotide 

-34" 



wo 99/57270 



PCTAJS99/09191 



sequences to the targeted organ, tissue/orceUpopi^ Methods wUch are well known 
to those skilled in the art can be used to construct vectors to express nucleic acid sequences 
complementary to the polynucleotides encoding REC (Sambrook, supra: Ausubel, supra) . 
Genes encoding REC can be turned ofTby transforming a cell or tissue with 

5 expression vectors which express high levels of a polynucleotide, or fiagment thereof, 
encoding REC. Such constructs may be used to introduce untranslatable sense or antisense 
sequences into a cell. Even in the absence ofintegration into the DN A, such vectors may 
continue to transcribe RNA molecules untU they are disabled by endogenous nucleases. 
Transient expression may last for a month or more with a non-replicating vector, and may 

10 last even longer if appropriate replication elements are part of the vector system. 

As mentioned above, modifications of gene ^qiression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', 
or regulatory regions of the gene encoding REC. Oligonucleotides derived firom the 
transcription initiation site, e.g., between about positions -10 and +10 from the start site, are 

IS preferred. Similarly, inhibition can be achieved using triple helix base-pairing 

methodology. Triple helix pairing is useful because it causes inhibition of the ability of the 
double helix to open sufficiently for the binding of polymerases, transcription factors, or 
regulatory molecules. Recent therapeutic advances using triplex DNA have been described 
in tiie literature (Gee, J,E. et al. (1994) in Huber, B.E. and B.L Cair, Molecular and 

20 Immunologic Approaches, Futura Publishing Co., Mt Kisco NY, pp. 163-177). A 

complementary sequence or antisense molecule may also be designed to block translation 
of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific 
cleavage of RNA. The mechanism of ribo^me action involves sequence*specific 

25 hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. For example, engineered hammerhead motif ribozyme 
molecules may specifically and efficientiy catalyze endonucleolytic cleavage of sequences 
encoding REC. 

Specific ribozyme cleavage sites within any potential RNA target axe initially 
30 identified by scanning the target molecule for ribo^me cleavage sites, including the 
following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of 
between IS and 20 ribonucleotides, corresponding to the region of the target gene 
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containing the cleavage she, may be evaluated for secondary structural features which may 
render the oligonucleotide inoperable. The suitability of candidate targets may also be 
evaluated by testing accessibility to hybridization with complementaiy oligonucleotides 
using ribonuclease protection assays. 

5 Complementary ribonucleic acid molecules and ribozymes of the invention may be 

prepared by any method known in the art for the synthesis of nucleic acid molecules. Iliese 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by 
in vitro and in vivo transcription of DNA sequences encoding REC. Such DNA sequences 

1 0 m^ be incorporated into a wide variety of vectors with suitable RNA polymerase 
promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or indudbly, can be introduced into cell Unes, cells, or 
tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. 
IS Possible modifications include, but are not limited to, the addition of flanking sequences at 
the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 2 O-methyl rather 
than phosphodiesterase linkages within the backbone of the molecule. This concept is 
inherent in the production of PNAs and can be extended in all of these molecules by the 
inclusion of nontraditioiml bases such as inosine, queosine, and wybutosine, as well as 
20 acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, 
and uridine which are not as easily recognized by endogerK>us endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo, in vitro, and ex vivo. For ex vivo tfaaapy^ vectors may be 
introduced into stem cells takm fiom the patient and clonally propagated for autologous 
25 transplant back into that same patient DeUvery by transfection, by liposome injections, or 
by polycationic amino polymers may be achieved using metiiods v^ch are well known in 
the art (See, e.g., Goldman, CJC et al. (1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be aiq)lied to any subject in 
need of such therq>y, including, for example, mammals such as dogs, cats, cows, horses, 
30 rabbits, monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable 
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canier, for any of the tfaerapeude effects discussed above; Such phannaceuticat 
compositions may consist of REQ antibodies to REC, and mimetics, agonists, antagonists, 
or inhibitors of REC. The compositions may be administered alone or in combmation with 
at least one other agent, such as a stabilizing compoimd, which may be administered in any 

5 sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, bufTered 
saline, dextrose, and water. The compositions may be administered to a patient alone, or in 
combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by 
any number of routes including, but not limited to, oral, intravenous, intramuscular, 

10 intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, 
intn^ieritoneal, intranasal, enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain 
suitable pharmaceudcally-acceptable carriers comprising caccipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 

15 phannaceutically. Further details on techniques for formulation aiidadnimistration may be 
found m the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing Co., 
Easton, PA). 

Pharmaceutical compositions for oral admmistration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 

20 administration. Such carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for 
ingestion by the patient 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules 

25 (optionally, after grindmg) to obtain tablets or dragee cores. Suitable auxiliaries can be 
added, if desired Suitable excipients include carbohydrate or protein fillers, such as sugars, 
including lactose, sucrose, mannitol, and sorbitol; starch fiom com, wheat, rice, potato, or 
other plants; cellulose, such as metfiyl cellulose, hydroxypropylmethyl-cellulose, or sodium 
carboxymediylcellulose; gums, including arabic and tragacanfh; and protems, such as 

30 gelatin and collagen. If desired, dismtegrating or solubilizing agents may be added, such as 
the cross-linked polyvmyl pyrrolidone, agar, and algmic acid or a salt diereof, such as 
sodium alginate. 
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Dragee coreyniaf be luis^ iff conjunction' with suitable coatings, such as 
concentrated si^ar solutions, >^ch may also contain gum arable, talc, 
polyvinylpyiTolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. DyestufEs or pigments may be 

5 added to the tablets or dragee coatings for product identification or to characterize the 
quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit ci4)5ules can contain active ingredients mixed with fillers or 

1 0 binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft aq[)sules, the active compounds may be dissolved or 
suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or 
without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be 

15 formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks's solution. Ringer's solution, or physiologically buffered saline. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such as 
sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the 
active compounds may be prepared as ^propriate oily injection suspensions. Suitable 

20 lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid 
esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino 
polymers may also be used for delivery. Optionally, the suspension may also contain 
suitable stabilizers or agents to increase the solubility of the compounds and allow for the 
preparation of highly concentrated solutions. 

25 For topical or nasal administration, penetrants appropriate to the particular barrier to 

be permeated are used in the formulatiorL Such penetrants are generally known in the art 

The pharmaceutical compositions of the present invention may be manufiictured in a 
maimer that is known in the art, e.g., by means of convrational mixiiig^ dissolving, 
granulating, dragee-making, levigating, emulsifying, encq>suiating, entnq>ping, or 

30 lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with 
many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, 
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maUc, and succimc aekL SritstendtobemoiesolttMehiaqoeoasoro A p r o to ui c 
solvents tban are the conesponding fi:ee base fonns. In other cases, the prefmed 
preparation may be a lyophilized powder which may contain any or all of the following: 1 
mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitoU at a pH range of 4.5 
5 to 5.5, that is combined with biiffer prior to use. 

After phannaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicatfd condition. For 
administration of REC, such labeling would include amount, fiequmcy, and method of 
administration. 

10 Phannaceudcal compositions suitable for use in the invention include conqwsitions 

vAeiein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The detennination of an effective dose is well within the capability of those 
skilled in tl^ art 

For any compound, the therapeutically effective dose can be estimated initially 

IS either in cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, 
rabbits, dogs, or pigs. An aiiirnal model may also be used to detenniiie the appropriate 
concentration range and route of administratiorL Such information can then be used to 
determine useful doses and romes for administration in humans. 

A theraqpeutically effective dose refers to that amount of active ingredient, for 

20 example REC or fragments thereof, antibodies of REC, and agonists, antagonists or 

inhibitors of REC, which ameliorates the symptoms or condition. Therapeutic efficacy and 
toxicity may be determined by standard pharmaceutical procedures in cell cultures or with 
experimental animals, such as by calculating the ED50 (the dose therapeutically effective in 
50% of the population) or LD50 (the dose lethal to 50% of the population) statistics. The 

25 dose ratio of therapeutic to toxic effects is the therapeutic index, and it can be expressed as 
the EDsq/LDso ratio. Pharmaceutical compositions which exhibit large theraf)eutic indices 
are jneferred. The data obtained fiom cell culture assays and anim al studies are used to 
formulate a range ofdosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concoitrations that includes the ED50 with littie or 

30 no toxicity. The dosage varies within this range depending upon the dosage form 
employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to 
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the subject icQtHTU^^trcflflnMHit* E)os8§^8d^ sdflsoDstivtioiiT 8TCF od^nstcd to provide 
sufficient levels of the active moiety or to maintain the desired effect Factors v^ch may 
be taken into account include the severity of the disease state, the general health of the 
subject, the age, weight, and gender of the subject, time and fiequency of administration, 

5 drug combination(s), reaction sensitivities, and response to therapy. Long-acting 
pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 //gto 1 00,000 A^g,iq) to a total 
dose of about 1 gram, depending upon the route of administration. Guidance as to 

10 particular dosages and methods of delivery is provided in the literature and goierally 
available to practitioners in the art Those skilled in the art will employ different 
formulatioiis for nucleotides than for proteiiis or their inhibitors. Similarly, delivery of 
polynucleotides or polypeptides will be specific to particular cells, conditions, locations, 
etc. 

15 DIAGNOSTICS 

In another embodiment, antibodies which specifically bind REC may be used for the 
diagnosis of disorders characterized by expression of REC, or in assays to monitor patients 
being treated with REC or agonists* antagonists, or inhibitors of REC. Antibodies usefiil 
for diagnostic purposes may be prepared in the same maimer as described above for 

20 therapeutics. Diagnostic assays for REC mclude methods which utilize the antibody and a 
label to detect REC in human body fluids or in extracts of cells or tissues. The antibodies 
may be used with or without modification, and may be labeled by covalent or non-covalent 
attachment of a reporter molecule. A wide variety of reporter molecules, several of \iduch 
are described above, are known in the ait and may be used. 

25 A variety of protocols for measuring REC, including ELISAs, RIAs, and FACS, are 

known in the art and provide a basis for diagnosing altered or abnormal levels of REC 
expression. Normal or standard values for REC expression are established by combining 
body fluids or cell extracts taken fix>m normal mamTi ifliifln subjects, preferably human, with 
antibody to REC under conditions suitable for complex formation The amount of standard 

30 complex formation may be quantitated by various methods, preferably by photometric 

means. Quantities of REC e3q>ressed in subject, control, and disease samples fiom biopsied 
tissues are compared with the standard values. Deviation between standard and subject 
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values establishes Ae pagamet&W 'fer diagnosmg disease. 

In anotho: embodiment of the invention, the polynucleotides encoding REC may be 
used for diagnostic purposes. The polynucleotides which may be used include 
oligonucleotide sequences, complementary RNA and DNA molecules, and PNAs (Nielsen, 

5 P.E. et al. (1993) Anticancer Drug Des. 8:53-63). The polynucleotides may be used to 
detect and quantitate gene expression in biopsied tissues in which expression of REC may 
be correlated with disease. The diagnostic assay may be used to detennine absence, 
presoice, and excess expression of REC, and to monitor regulation of REC levels during 
thenqpeudc intervention. 

10 In one aspect, hybridization with PCR probes which are capable of detecting 

polynucleotide sequences, including genomic sequences, encoding REC or closely related 
molecules may be used to identify nucleic acid sequences \^ch encode REC. The 
specificity of the probe, whether it is made fiom a highly specific region, e.g., the S' 
regulatory region, or fiom a less specific region, e.g., a conserved motiC and the stringency 

15 of the hybridization or amplification (maximal, high, intennediate, or low), will detennine 
blether the probe identifies only naturally occuiring sequences encoding REC, alleUc 
variants, or related sequences. 

Probes may also be used for the detection of related sequences, and should 
preferably have at least 50% sequence identity to any of the REC encoding sequences. The 

20 hybridization probes of the subject invention may be DNA or RNA and may be derived 
fiom the sequence of SEQ ID NOs: 1 7-32, or fiom genomic sequences including promoters, 
enhancers, and introns of the REC gene. 

Means for producing specific hybridization probes for DNAs encoding REC include 
the cloning of polynucleotide sequences encoding REC or REC derivatives into vectors for 

25 the production of mRNA probes. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by means of the addition of 
the iq)propriate RNA polymerases and die appropriate labeled nucleotides. Hybridizadon 
probes may be labeled by a variety of rqx>rter groups, fi>r example, by radionuclides such 
as ^ or ^^S, or by enzymatic labels, such as alkalme phosphatase coupled to the probe via 

30 avidm/biotin couplii^ systems, and the like. 

Polynucleotide sequences encoding REC may be used for the diagnosis of a disorder 
associated with expression of REC. Examples of such a disorder include, but are not 
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limited to, a neoplastic disonfef; sixk as, ad en ooitiuuiM, fcuktmia, lymphoma, 
melanoma, myeloma, sarcoma, teratocardnoma, and, in particular, cancers of the adrenal 
gland, bladder, bone, bone manow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 

5 prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an immunological 
disorder, such as, acquired inununodeficiency syndrome (AIDS), Addison's disease, adult 
respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoinunune honolytic anemia, autnimnume thyroiditis, bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dennatomyositis, 

10 diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, 
Goodpasture's syndrome, gout. Graves' disease, Hashimoto's diyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 

IS polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's 
syndrome, systemic anaphylaxis, systraiic hspos erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of 
cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, 
protozoal, and helminthic infections^ and trauma; a reproductive disorder, such as, disorders 

20 of prolactin production; infertility, including tubal disease, ovulatory defects, and 
endometriosis; disruptions of the estrous cycle, disruptions of the menstrual cycle, 
polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian 
tumors, uterine fibroids, autoinumme disorders, ectopic pregnancies, and teratogenesis; 
cancer of the breast, fibrocystic breast disease, and galactorrhea; disn4>tions of 

25 spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the prostate, 
benign prostatic hyperplasia, prostatitis, Peyronie's disease, cardnoma of the male breast, 
and gynecomastia; a gastrointestinal disorder, such as, dysphagia, peptic esophagitis, 
esophageal spasm, esophageal stricture, esophageal carcinoma, dyspepsia, indigestion, 
gastritis, gastric carcinoma, anorexia, nausea, emesis, gastroparesis, antral or pyloric edema, 

30 abdominal angina, pyrosis,^ gastroenteritis, intestinal obstruction, infections of the integrinai 
tract, pqitic ulcer, cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic 
carcinoma, biliary tract disease, hepatoma, infectious colitis, ulcerative colitis, ulcerative 
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proctitis, Crohn's disease, Whipple^s disease, Mallory-Weiss syndrome, colonic carcinoma, 
colonic obstruction, irritable bowel syndrome, short bowel syndrome, diarrhea, 
constipation, gastrointestinal hemorrhage, and acquired immunodeficiency syndrome 
(AIDS) enteropathy, cirrhosis, jaundice, cholestasis, hereditary hyperbilirubinemia, hepatic 

5 encephalopathy, hepatorenal syndrome, hepatitis, hepatic steatosis, hemochromatosis, 
Wilson^s disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary sclerosing 
cholangitis, liver in&rction, portal vein obstruction and thrombosis, passive congestion, 
centrilobular necrosis, peliosis hepads, hepatic vein thrombosis, veno-occlusive ^isefli^^ 
preeclampsia, eclampsia, acute &tty liver of pregnancy, intrahepatic cholestasis of 

1 0 pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas; 
a nervous disorder, such as, akathesia, Abiieuner's disease, amnesia, amyotrophic lateral 
sclerosis, bipolar disorder, c^atonia, cerebral neoplasms, rfeni<>nrift^ depression, diabetic 
neuropathy, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's 
disease, peripheral neuropathy, multiple sclerosis, neurofibromatosis, Parkinson's disease, 

15 paranoid psychoses, postherpetic neuralgia, schizophrenia, and Tourette*s disorder; a 
smooth muscle disorder, such as, angma, aiuq>hylactic shock, arrhythmias, asthma, 
cardiovascular shock, Cushing's syndrome, hypertension, hypoglycemia, myocardial 
infarction, migraine, and pheochroniocytoma, and myopathies including cardiomyopathy, 
encephalopathy, epilepsy, Keams-Sayre syndrome, lactic acidosis, myoclonic disorder, and 

20 ophthalmoplegia. Smooth muscle includes, but is not limited to, that of the blood vessels, 
gastrointestinal tract, heart, and uterus; and a musculoskaletal disorder, such as, Duchenne's 
muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, central core 
disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, infectious myositis, polymyositis, dennatomyositis, mclusion body myositis, 

25 thyrotoxic myopathy, and ethanol myopathy. The polynucleotide sequences encoding REC 
may be used in Southern or Northern analysis, dot blot, or other membrane-based 
technologies; in PCR technologies; in dipstick, pin, and ELISA assays; and in tmcxomays 
utilizing fluids or tissues firom patients to detea altered REC expressiort Such qualitative 
or quantitative methods are well known in the art 

30 In a particular aspect, the nucleotide sequences encoding REC may be usefiil in 

assays that detect the presence of associated disorders, particularly those mentioned above. 
The nucleotide sequences encoding REC may be labeled by standard methods and added to 

-43- 



wo 99/57270 



PCTAJS99A)9191 



a fluid or tissue saBq>ie firom a patient under conditions suitable for the formation of 
hybridization complexes. After a suitable incubation period, the sample is washed and the 
signal is quantitated and compared with a standard value. If the amount of signal in die 
patient sample is significantly altered in comparison to a control sample then the presoice 

5 of altered levels of nucleotide sequences encoding REC in the sample indicates the presence 
of the associated disorder. Such assays may also be used to evaluate the efiBcacy of a 
particular therapeutic treatment regimen in animal studies^ in clinical trials, or to monitor 
the treatment of an individual patient 

In order to provide a basis for the diagnosis of a disorder associated with expression 

10 of REC, a normal or standard profile for esqiression is established. Thismaybe 

accomplished by combining body fluids or cell extracts taken fiom normal subjects, eith^ 
animal or human, with a sequence, or a fingment diereof^ encoding REC, under conditions 
suitable for hybridization or amplification. Standard hybridization may be quantified by 
comparing the values obtained fiom normal subjects with values bom an experiment in 

15 lA^di a ]au>wnaniountofa substantially purified polynucleotide is used. Standard values 
obtained in this manner may be compared with values obtained firom samples from padents 
who are symptomatic for a disorder. Deviation fiom standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

20 hybridization assays may be repeated on a regular basis to determine if the level of 
expression in the patient begins to approximate that which is observed in the normal 
subject The results obtained &om successive assays may be used to show the efficacy of 
treatment over a period ranging fiom several days to months. 

Wiih respect to cancer, the presence of a relatively high amount of transcript in 

25 biopsied tissue &om an individual may indicate a predisposition for the development of the 
disease, or may provide a means for detecting the disease prior to the appeaaaax of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the 
development or fiirther progression of the cancer. 

30 Additional diagnostic uses for oligonucleotides designed from the sequences 

encoding REC may involve the use of PCR. These oligomers may be chemically 
synthesized, generated enzymatically. or produced in vitro. Oligomers will preferably 
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contain a fiagment a potynuckoUde encoding REC, or a fiagment of a polynucleotide 
complementaiy to the polynucleotide encoding R£C, and wiU be employed under 
optimized conditions for identification of a specific gene or condition. Oligomers may also 
be enq)loyed under less stringent conditions for detection or quantitation of closely related 

5 DNA or RNA sequences. 

Methods v/bidi may also be used to quantitate the expression of REC include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g^ Melby, P.C. et aL (1993) J. Immunol. 
Methods 159:235-244; and Duplaa, C. et al. (1993) AnaL Biochem. 229-236.) The speed of 

10 quantitation of multiple samples may be accelerated by nmning the assay in an ELISA 
format where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives i^id quandtatioiL 

In further embodiments, oligonucleotides or longer fi:agments derived from any of 
die polynucleotide sequences of the Sequence Listing may be used as targets in a 

15 microarray. Themicn>aiTay can be used to morutor the expression level of large numbers 
of genes simultaneously and to identify genetic variants, mutations, and polymorphisms. 
This information may be used to determine gene fimction, to understand the genetic basis of 
a disorder, to diagnose a disorder, and to develop and monitor xhe activities of therapeutic 
agents. 

20 Microarrays may be prepared, used, and analyzed using methods known in the art 

(See, e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) 
Proc- Natl. Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application 
W095/251 1 16; Shalon, D. et al. (1995) PCT application WO95/35505; HeUer, R.A. et al. 
(1997) Proc. Natl. Acad. ScL 94:2150-2155; and Heller, MJ. et al. (1997) US. Patent No. 

25 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding REC may 
be used to generate hybridization probes usefiil in mapping the rmturally occurring genomic 
sequence. The sequences may be riiapped to a particular chromosome, to a specific region 
of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
30 chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial 

chromosomes (BACs), bacterial PI constructions, or single chromosome cDNA libraries 
(Price, CM. (1993) Blood Rev. 7:127-134; Trask, B J. (1991) Trends CSeneL 7:149-154). 
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Huoiescent in.situ hybridizadon (FI^ 
chromosome mapping techniques and genetic map data (Heinz-Ulrich, et al. (199S) in 
Meyers, RA. (ed) Molecular Biology and Biotechnolopv. VCH Publishers New York NY, 
pp. 965-968). Exan^>les of genetic map data can be foimd in various scientific journals or 

5 at the Online Mendeiian Inheritance in Man (OMIM) site. Correlation between the location 
of the gene encoding REC on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with 
that disorder. The nucleotide sequences of the invention may be used to detect differences 
in grae sequences among normal, carrier, and affected individuals. 

10 In situ hybridization of chromosomal preparations and physical mapping techniques, 

such as linkage analysis using established chromosomal markers, may be used for 
extending genetic maps. Often the placement of a gene on the duomosome of another 
mammalian species, such as mouse, may reveal associated markers even if the number or 
arm ofa particular human chromosome is not known* New sequences can be assigned to 

15 chromosomal arms by physical mapping. This provides valuable information to 

investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the disease or syndrome has been crudely localized by genetic linkage to 
a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences nu^ping 
to that area may represent associated or regulatory genes for further investigation. (See, 

20 e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of die subject 
invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, REC, its catalytic or immunogenic 
fiagments, or oligopeptides thereof can be used for screoiing libraries of compounds in any 

25 ofa variety ofdrug screening techniques. The firagment employed in such screening may 
be fiee in solution, afiGxed to a solid support, borne on a cell sur&ce, or located 
intracellularly. The formation of binding complexes between REC and the agent being 
tested may be measured. 

Another technique for drug screening provides for high throughput screening of 

30 compounds having suitable bmding afSnity to the protein of interest (Geysen, et al. (1984) 
PCT application WO84/03S64). Indus method, large numbers ofdifferent small test 
compounds are synth e sized on a solid substrate, such as plastic pins or some other sur&ce. 
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The test compounds aie reaeted widt REC» or fiagments thereof and washed Bound REC 
is then detected by methods well known in the art Purified REC can also be coated directly 
onto plates for use in the aforementioned dmg screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid 
5 support 

In another embodiment, one may use competitive drug screening assays in whidi 
neutralizing antibodies capable of binding REC specifically compete with a test compound 
for binding REC. In this manner, antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic detenninants with REC. 
1 0 In additional raibodiments, die nucleotide sequences vMdh encode REC may be 

used in any molecular biology techniques that have yet to be developed, provided the new 
techniques rely on properties of nucleotide sequences that are currently known, including, 
but not limited to, such properties as the triplet genetic code and specific base pair 
interactions. 

15 The examples below are provided to illustrate the subject invention and are not 

included for the purpose of limiting the inventiorL 

EXAMPLES 

L cDNA Libraiy Construction 

Tissue Description 

20 Ibe TBLYNOTOl Ubrary was constructed at Stratagene (STR937214) using RNA 

isolated fiom a hybrid of T-B lymphoblasts fiom a leukemic cell line. 

The HNT2NOT01 library was constructed at Stratagene (STR937230) using RNA 
isolated fit>m the hNT2 cell line derived from a himian teratocarcinoma that exhibited 
properties characteristic of a committed neuronal precursor at an early stage of 

25 development 

The PROSTUTOS library was constructed using polyA RNA isolated fiom prostate 
tumor tissue removed fix>m a 69-year-old Caucasian male. Pathology indicated 
adenofibromatous hyperplasia and adenocarcinoma (Gleason grade 3 and 4). The tumor 
invaded the cs^)sule but did not extend beyond it; perineural invasion was present The 
30 patient presented with elevated prostate specific antigen. Patient history included occlusion 
of a leg vein, diverticuli of the colon,and a partial colectomy. Family history included 
cardiovascular disease, multiple myeloma, hyperiipidemia, and rheumatoid arthritis. 
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The BRSTNOT05 library tm constnicted using pofyA RNA isolated from 
nontumorous breast tissxie removed fiom a 58-year-old Caucasian female. Pathology for 
the associated tumor tissue indicated multicentric invasive grade 4 lobular carcinoma. 
Patient history included skin cancer, rheumatic heart disease, osteoarthritis, and 
S tuberculosis. Family history included cerebrovascular and cardiovascular disease, breast 
and prostate cancer, and Type I diabetes. 

The THYRNOT03 library was constructed using polyA RNA isolated fix>m thyroid 
tissue removed fiom a 28-year-old Caucasian female* Pathology indicated adenomatous 
hyperplasia associated with follicular adenoma. Patient history included nonobstetrical 
10 galactorriiea, anemia, and pure hypercholesterolemia. Family history included 
hyperiipidemia skin cancer, and neurotic depression. 

The LUNGNOT14 library was constructed usmg polyA RNA isolated fiom 
nontumorous lung tissue removed fiom a 47-year-old Caucasian male during a segmental 
lung resection. Pathology ofthe associated tumor indicated a grade 4 adenocarcinoma and 
15 calcified granuloma. Patient history included benign hypertension and chronic obstructive 
pulmonary disease. Family history included cardiovascular disease, and Type II diabetes. 

The CONNNOTOl library was constructed using polyA RNA isolated fiom 
mesentery fat tissue removed fiom a 71 -year-old Caucasian male during a partial 
colectomy. Patient history included a diverticulosis and diverticulitis, cholecystectomy, 
20 viral hepatitis, and a hemagioma. The patient was taking Tegretol (carbamazepine). 
Family history included cardiovascular disease and extrinsic asthma. 

The KERANOT02 library was constructed using polyA RNA isolated fiom human 
breast keratinocyte cell line (NHEK, Clontech). 

The BEPINOTOl library was constructed using polyA RNA isolated fiom a 
25 bronchial epithelium primary cell line derived fix>m a 54-year-old Caucasian male (NHBE, 
Clontech). 

The BRSTNOT07 library was constructed using polyA RNA isolated fiom 
nontumorous breast tissue removed fiom a 43-year-old r!aiir-asian fismale. Pathology 
indicated mildly proliferative fibrocystic changes with epithelial hyperplasia, 
30 papillomatosis, and duct ectasia. The associated tumor tissue indicated invasive, grade 4 
mammaiy adenocarcinoma with extensive comedo necrosis. Family history included 
cardiovascular disease; epilepsy, and Type II diabetes. 
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The OVARNOT03 lihraiy was constracted using polyA RNA isolated fiom 
nontumorous ovarian tissue removed fiom a 43-year-old Caucasian female. Pathology for 
the associated tumor tissue indicated grade 2 mucinous cystadenocarcinoma. Patient 
history included mitral valve disorder, pneumonia, and viral hepatitis. Family history 
5 included cardiovascular and cerebrovascular disease and pancreatic, breast, and uterine 
cancer. 

The O V ARN OT02 library was constructed using poly A RNA isolated fiom ovarian 
tissue removed fiom a S9-year-old Caucasian female vfbo died of a myocardial infiurtion. 
Patient history included cardiovascular disease, hypercholesterolemia, hypotension, and 
10 arthritis. 

The ADRETUT06 library was constructed using polyA RNA isolated fiom adrenal 
tumor tissue removed fiom a S7-year-old Caucasian female. Patfiology indicated 
pheochromocytoma. Patient history included cardiovascular and cerebrovascular disease, 
type I diabetes, reflux esophagitis, and joint pain. Family history included cardiovascular 
15 disease, type I diabetes, renal &ilure, and skin cancer. 

The THYMFET02 library was constructed using polyA RNA isolated fiom thymus 
tissue removed fiom a Caucasian female fetus who died at 17 weeks* gestation from 
anencephaly. 

The SKINNOT04 library was constructed using polyA RNA isolated fiom breast 
20 skin tissue removed fiom a 70-year-old Caucasian female during a biopsy and resectioiL 
Pathology for the associated tumor tissue indicated invasive grade 3 adenocarcinoma. 
mRNA Isolation and Library Constniction 

RNA was purchased from Clontech (Palo Alto, C A) or isolated at Incyte from the 
tissues described above. The tissue was homogenized, lysed, and esctracted in phenol, 
25 guanidinium isothiocyanate, or a suitable nuxture of denaturants such as TRIZOL reagent 
(Life Technologies), a monophasic solution of phenol and guanidine isothiocyanate. To 
isolate RNA, lysate was centrifiiged over a Csc cushion, mfaced witii chloroform (1 :5 v/v), 
recovered in the aqueous phase and precipitated with isopropanol. Alternatively, lysate was 
electrophoresed through an agarose gel, and RNA was collected using Whitman P8 1 paper 
30 (Whitman, Lexington MA) and eluted. The eluted RNA was precipitated with sodium 
acetate and ethanoL The precipitant was resuspended in RNase-fiee water. Forsome 
libraries, RNA was treated with DNase; and for others, phenol extraction and precipitation 
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were repeated. For most libnics, prty(A+) RNA was isolated using oligo d(T)-coiipled 
paramagnetic particles (Promega), Oligotex resin or the Oligotex kit (QIAGEN Inc, 
Chatsworth, CA), or the Stratagene RNA Isolation kit Alternatively, RNA was isolated 
directly from tissue lysates using the Ambion PolyA Quick kit (Ambion, Austin, TX). 

5 The cDNA libraries were synthesized and constructed at Stratagene or at Incyte 

according to procedures recommended m the UNIZAP vector (Stratagene) or 
SUPERSCRIPT plasmid system (Life Technologies), both of v^di are based on methods 
well known in the art (Ausubel, supra, units 5.1-6.6). Alternatively, cDNA libraries were 
constructed by Stratagene usmg RNA provided by Incyte. Reverse transcription was 

10 initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were 
ligated to double stranded cDNA, and cDNA was digested with an appropriate restriction 
enzyme(s). For most libraries, cDNA was size-selected (300-1000 bp) usmg Sephacryl 
SIOOO or Sepharose CL2B or CL4B column chromatogr^hy (Amersham Pharmacia 
Biotech) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible 

15 restriction enzyme site of the polylinker of a suitable plasmid, e.g., pBLUESCRIPT 
(Stratagene), pSPORT 1 (Life Technologies), pINCYl (Incyte Pharmaceuticals Inc, Palo 
Alto, CA). pINCYl was amplified in JM109 cells and purified using the QiaQuick column 
(QIAGEN Inc). Recombinant plasmids were transformed into competent E. coli cells, e.g., 
XLl-Blue, XLl-BlueMRF, or SOLR (Stratagene) or DH5a , DHIOB, or ElectroMAX 

20 DHIOB (Life Technologies). 

n. Isolation and Sequencing of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision (UniZAP vector 
system, Stratagene) or by cell lysis. Plasmids were purified usmg the MAGIC MINIPREPS 
DNA purification system (Promega, Madison, WI); Miniprep kit (Advanced Genetic 

25 Technologies Corporation, Gaithersburg, MD); QL\well-8 Plasmid, QIAwell PLUS DNA, 
or QIAwell ULTRA DNA purification systems; or REAL Prep 96 plasmid kit (QIAGEN 
Inc) using the recommended protocol. Following precipitation, plasmids were resuspended 
in 0.1 ml of distilled water and stored, with or widiout lyophilization, at 4''C. 

Alternatively, plasmid DNA was anqilified fi:om host cell lysates using direct link 

30 PCR (Rao, V.B. (1994) Anal. Biochem. 216: 1-14) m a high-Aroughput format Host cell 
lysis and thermal cycling steps were carried out m a single reaction mixture. Samples were 
processed and stored in 384-well plates ((Genetix Ltd, Christchurch UK) and concentration 
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10 



15 



20 



25 



30 



35 



40 



of amplified ^asmid DNA- was quantified fiuorometrically using Pico Green Dye 
(Molecular Probes, Eugene OR) and a Fluoroscan n fluorescence scanner (Labsystems Oy, 
Helsinki, Finland), 
in. Sequencing and Analysis 

The cDNAs were prepared for sequencing using either an ABI Catalyst 800 (Perkin 
Ehner) or a Hamilton Micro Lab 2200 (Hamilton, Reno, NV) in combination with Peltier 
Thermal Cyclers (PTC200; MJ Research, Wateitown MA). The cDNAs were sequenced on 
the ABI 373 or 377 DNA Sequencing sys^tems (Perkin Ehner) by the method of Sanger et 
al. (1975; J. Mol. Biol. 94:441Q using stardard ABI protocols and kits. In die alternative, 
cDN As may have been sequenced using solutions and dyes fit>m Amersham Pharmacia 
Biotech. Reading fiame was determined usiiig standard methods (AusubeU supra). 

The cDNA sequences presented by Incyte Clone number m the last column of Table 
1 and the fiiU length nucleotide and amino add sequences disclosed in the Sequence Listing 
were analyzed and characterized using several of the following programs (or algorithms) 
and databases. For PFAM, scores >1 1 report a significant degree of correlation; and the 
higher the value, the more homologous die query sequence is to members of the protein 
family. HMM models which wm used to identify and confirm signal sequences 
(SIGPEPT), transmembrane domains (TM) and the receptors disclosed in the Sequence 
Listing were developed with annotated sequences fi?om LIFESEQ* database (Incyte 
Pharmaceuticals, Palo Alto CA) and SwissProt database. BLAST and the derivation of 
product score are described in example IV below. 

Program/algorithm Databases Description 



cDNAs 



bases 



Smith Watorman 



FASTA 



for 



FuU Ungth 
Phred 



GenBank Local alignment algorhfam 
for homology searching 



GenBank Fast nucleotide sequence 

d at abas e searching program 
for UNIX, VMS 

GenBank Ultra-fast database 

searching program for 

UNDC.VMS Csouice 

Reads trace data from 
sequencing runs, makes base 
calls, produces quality scores 



Useful Parameters 



minlengdi»»49nt 
<12% uncalled 



Log likelihood for 
matches is 10"^ and 
h(miologs>10^ 
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10 



15 



20 



25 



30 



Phlame 

CONSED 
BLAST 

FASTX 

BLIMPS 
suggestive 

PFAM 



HMM 
(SIGPEPT, 
TM,and 
Recqitor) 



GenBank 
SwissProt 

GenBank 
SwissProt 

BLOCKS 
PRINTS 



PROSITE 



SwissProt 



andDNAi 
Reads trace data from 
sequencing runs, makes base 
calls, produces quality scores 
andDNAsequcaice 
Quality-score based 
assembly program for diolgun 
sequoices 

Graphical tool for editing 
PHRAPcontigs 
Ultra-&5t database searching 
program forUNDC, VMS 
C source 

Fast ammo acid sequence 
database searching pro gr am 
forUNDQVMS 
Weighted-matrix analyses used 
to predict protein classification 



match>56 
score>120 



score> 100 
P<le-5 

loglitelihood>17 



>1300 strong 
1000-1300 

P<le-3 
>11 strong 
8-10 suggestive 



Analyses 3-60 amino acid 
long sequences which 
correspond to highly 
cons^ed regions of a 
pn^ein family 

Hidden Markov Models analyze Score >1] 
primary structures of gene families 
using probabilistic approaches 
and trained models 



IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a 

transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a 

membrane on which RNAs irom a particular cell type or tissue have been bound 

35 (Sambrook, supnu ch* 7). 

Analogous computer techniques applying BLAST are used to search for identical or 

related molecules in nucleotide databases such as GenBank or LIFESEQ^ database (Incyte 

Phamaceuticals). This analysis is much fester than multiple membiane-based 

hybridizations. In addition, die sensitivity of the computer search can be modified to 

40 detmnine ^etha any secular match is categprized as exact or similar. 

The basis of the search is the product score, which is defined as: 

% sequence identity x % maximum BLAST score 
100 

The product score takes into account both the degree of similarity between two sequences 
45 and the length of the sequence match. For &cample,vrith a product score of40, the match 
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will be exact within a 1 % to 2% eiTor» and, with a product score of 70, the match vAU be 
exact Similar molecules are usually identified by selecting those which show product 
scores between 1 S and 40, although lower scores may identify related molecules. 

The results of Northern analysis are reported as a list of libraries in vAnch the 
5 transcript encoding REC occurs. Abimdance and percent abundance are also reported. 
Abundance directly reflects the number of times a particular transcript is represented in a 
cDNA library, and percent abundance is abundance divided by the total nuniber of 
sequences examined in the cDNA library. 

10 v. Extension of REC Encoding Polynncleotides 

The nucleic acid sequences of Incyte Clones 044150, 266775, 843183, 965938, 
1441620, 151091 1, 2022379, 2024312, 2057886, 2121924, 2122815, 2132179, 2326441, 
2825826, 2936050, and 3428945 were used to design oligonucleotide primers for extending 
partial nucleotide sequences to full length. For each nucleic acid sequence, one primer was 

15 synthesized to initiate extension of an antisense polynucleotide, and the other was 

synthesized to initiate extension of a sense polynucleotide. Primers were used to facilitate 
the extension of the known sequmce ^outward" generating amplicons containing new 
imknown nucleotide sequence for the region of interest The initial primers were designed 
from the cDNA using OLIGO 4.06 (National Biosciences, Plymouth, MN), or another 

20 appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the target sequence at temperatures of about 68 °C to 
about 72 ''C. Any stretch of nucleotides which would result in hairpin structures and 
primer-primer dimerizations was avoided. 

Selected human cDNA libraries (Life Technologies) were used to extend the 

25 sequence. Ifmore than one extension b necessary or desired, additional sets of primers are 
designed to further extend the known region. 

High fidelity amplification was obtained by following the instructions for the XL- 
PCR kit (PeikinEhner) and thorougUy mixing the enzyme and reaction mix. PCRwas 
performed using the Peltier Thennal Cycler (MJ Research), beginning with 40 pmol of each 

30 primer and the recommended concentrations of all other components of the kit» with the 
following parameters: 

Step 1 94** C for 1 min (initial denaturation) 
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oiep z 


03 lor I mm 




oo v» lor o mm 




y*r lor 1 J sec 




03 u lor 1 mm 






Step? 


Repeat steps 4 through 6 for an additional IS cycles 


Stq>8 


94° C for 15 sec 


Step9 


eS'Cforlmin 


Step 10 


68°Cfor7:15inin 


Step 11 


Repeat steps 8 through 10 fat an additional 12 cycles 


Stq> 12 


72° C for 8 nun 


Step 13 


4° C (and holding) 



A 5 /il to 10 ^1 aliquot of the reaction mixture was analyzed by electrophoresis on 

1 S a low concentration (about 0.6% to 0.8%) agarose mini-gel to determine which reactions 
were successful in extending the sequence. Bands thought to contain the largest products 
were excised bom the geU purified using QIAQUICK (QIAGEN Inc.), and trimmed of 
overhangs using Klenow en^me to facilitate religation and cloning. 

After ethanol precipitation, the products were redissolved in 13 of ligation 

20 buffer, 1^1 T4-DNA ligase (IS units) and 1/xl T4 polynucleotide kinase were added, and the 
mixture was iricubated at room temperature for 2 to 3 hours, or overnight at 16"* C. 
Competent E> coli cells (in 40 fA of appropriate media) were transformed with 3 fA of 
ligation mixture and cultured in 80 ^1 of SOC medium. (See, e.g., Samlnrook, supra. 
Appendix A, p. 2.) After incubation for one hour at ST^'C, the E.coli mixture was plated on 

25 Luria Bertani (LB) agar (See, e.g., Sambrook, supra. Appendix A, p. 1) contaming 

carbenicillin (2x carb). The following day, several colonies were randomly picked from 
each plate and cultured in 1 SO ^1 of liquid LB/2x carb medium placed in an individual well 
of an appropriate commercially-available sterile 96-well microtiter plate. The following 
day, 5 /il of each overnight culture was transferred into a non-sterile 96-well plate and, after 

30 dilution 1:10 with water, 5 fxl from each sample was transferred into a PCR array. 

For PCR amplification, 1 8 ^1 of concentrated PCR reaction mix (3.3x) containing 4 

units of rTth DNA polymerase, a vector primer, and one or both of the gene specific 

primers used for the extension reaction were added to each well. Amplification was 

performed using the following conditions: 

35 Stepl 94** C for 60 sec 

Step2 94** C for 20 sec 

Step 3 S5°Cfor30sec 
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Step 4 72'*C foresee 

Step 5 Repeat steps 2 tbiough 4 for an additional 29 cycles 

Step6 72°Cforl80sec 
Step 7 4'' C (and holding) 

s 

Aliquots of the PCR reactions were run on agarose gels togetfier with molecular 
weight maikers. The sizes of the PCR products were compared to the original partial 
cDNAs, and iq;)propriate clones wm selected, ligated into pla^d, and sequenced. 

In like manner, tfie nucleotide sequence of SEQ ID NOs:17-32, are used to obtain 
10 S' r^atory sequences using the procedure above, oligonucleotides designed for S' 
mtension, and an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived fiom SEQ ID NOs: 1 7-32 are employed to screen 
cDNAs» genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting 

15 of about 20 base pairs, is specifically described, essentially die same procedure is used with 
larger nucleotide firagments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4.06 software (National Biosciences) and labeled by combining SO pmol of 
each oligomer, 250 fiCi of [y-^^J adenosine triphosphate (Amersham Pharmacia Biotech), 
and T4 polynucleotide kinase (DuPont NEM*, Boston, MA). The labeled oligonucleotides 

20 are substantially purified using a SEPHADEX G-2S siqperfine size exclusion dextran bead 
colunm (Pharmacia & Upjohn, Kalamazoo, NO). An aliquot containing 10^ counts per 
mimite of the labeled probe is used in a typical membrane-based hybridization analy^ of 
human genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco 
RI, Pst I, Xbal, or Pvu n (DuPont NEN, Boston, MA). 

25 The DNA fiom each digest is firactionated on a 0.7% agarose gel and transferred to 

nylon membranes (Nytran Plus, Schleicher St Schuell, Durham, NH). Hybridization is 
carried out for 1 6 hours at 40°C. To remove nonspecific signals, blots are sequentially 
washed at room temperaturo under increa^ngly stringent conditions up to 0.1 x saline 

finrffiim ritratf> atiH i\ 'jOA gnHnim 4^v^^^ 'Slllfflte Aftpr XOMAT AP film (FHfllf, 

30 Rochester, NY) is exposed to the blots to film for several hours, hybridization patterns are 
compared visually. 

VII. Microarrays 
Membrane Preparation 
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A single 22 X 22 cm nyk» me m bran e suitaMe for stan^^ piuUj cols 
is spotted with plant cDNA clones as follows. The clones are robotically picked and 
arrayed into 384-well culture dishes. The cultures are gridded, using a Q-Bot robot 
(Genedx Ltd), onto the nylon membrane at a density of 36,864 spots per membrane or 

5 1 8,394 individual genes and 38 controls spotted in diq)licate. Hiese membranes are used in 
standard hybridization protocols described below. 

Several membranes are placed on LB plates with carbenicillin in bioassay trays and 
grown for about 16 hours at 42^C after vMdi the membranes are placed (colony side iq;>) 
for 4 minutes on top of Whatman filter psqper (Whatman Inc^ Lexington MA) previously 

10 saturated with piewaimed (PS'^C to 100*C) denaturing bufiFer (1.5M NaCl, 0.5M NaOH). 
Excess denaturing buffer is rmo ved, and the membranes are satimited for 4 tpipiiteiy with 
neutralizii^ buffer (1.5M NaCl, IM Tris (Tris[hydn)xymethyl]aminomediane) pH 8.0) by 
placing them (colony side up) on top of Whatman filter paper (Whatman, Inc) previously 
saturated with neutralizing buffer. The membranes are dried until no liquid is visible on 

15 their surfaces* 

Next the membranes are submerged, colony side down, in 100 ml prewaxmed (42*^ 
C) proteinase K buffer vAdch consists of 0.1 M NaCl, SO mM EDTA pH 8.S, 50 mM Tris 
pH 8.0, Sarkosyl (1% N-lauroyl sarcosme), and 1 mg/mi proteinase K (Sigma). After one 
hour, the membranes are retrieved and placed on Whatman filter paper (Whatman, Inc) to 
20 dry overnight Finally, the membranes are exposed to UV light (254 mn for 40 seconds) in 
a GS Gene Linker UV Chamber (Bio-Rad Laboratories, Hercules CA) which cross^links the 
DNA to the membranes. 
Probe Preparation 

Five mRNA and 2 fxl random hexamer (0.5 mg/ml; Life Technologies) are 
25 combined in a 1 .5 ml RNase-firee microcentrifuge tube. The sample is incubated at TO^'C 
for 1 0 minutes, placed on ice for five minutes, lyophilized to dryness, and then dissolved in 
the following: 1.6 ^1 5x first strand buffer, 0.8 ^1 0.1 M DTT, 0.4 fxl 10 mM dA/dG/dT 
mo, 4.0 Ml m dCTP (3000 Ci/mmol, 10 uCi//il) and 1.2 m1 Superscript U RT (200 U/a^I; 
Life Technologies). 

30 Thesampleiscentrifi]gedandincubatedai42''Cforl to 2 hours and then diluted 

with 42 fjl of sterile water. Unincdxporated nucleotides are removed with a ProbeQuant G- 
50 Microcolumn (Amersham Pharmacia Biotech). The purified sample is boiled at 95 °C 
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for 3 minutes and then put on ice. To degrade mRNA, 12.5 ^ii of IN Na^ arc added to 
die sample which then is incubated at iT'C for 10 minutes. The sample is neutralized by 
addition of 12.5 ^1 IM Tris pH 6.8 and 10 ^1 IM HCl. Degraded RNA is removed with a 
ProbeQuant G-SO Microcolumn fsupraV 
5 Hybridization 

The hybridization procedure described by Soares is followed (Soares et al. Proc. 
Natl. Acad. Sci. (1994) 91 :9228-9232). Ten mis prewarmed (42 X) hybridization buffer 
(0.75 M NaCl, 0.1 M NaP04, 0.1% (w/v) NaPjO,, 0.15 M Tris pH 7.5, 5x Denhardt 
solution (AusubeU supral 2% sodium dodecyl sulfiite (SDS), sheared salmon testes DNA 

10 (100 A<8/ml), 50% formamide) are added to the membranes in hybridization bags for greater 
than 2 hours to overnight for prehybridization. RadioIabelledprobe(^)isaddedtoanew 
10 ml aliquot of the prewarmed hybridization bufier, and hybridization is allowed to 
proceed at 42''C for 14 to 16 houis. 

After hybridization, membranes are rinsed with 200 ml 2x SSC at room 

1 5 tonperature for 5 minutes, washed once with prewanned 2x SSC plus 1% SDS for 20 

minutes at dS^'C, and then washed two more times with prewamied 0.6x SSC plus 1% SDS 
for 30 minutes at 68 °C. Damp membranes are exposed to X-OMAT autoradiography fihn 
(Kodak) for two nights in a Phosphoimager cassette (Molecular Dynamics) and developed. 

20 Vm. Complementary Polynucleotides 

Sequences complementary to the REC-encoding sequences, or any parts thereof, 
are used to detect, decrease, or inhibit expression of naturally occurring REG. Although use 
of oligonucleotides comprising firom about 15 to 30 base paiis is described, essentially the 
same procedure is used with smaller or with larger sequence firagments. Appropriate 

25 oligonucleotides are designed using OLIGO 4.06 software and the coding sequence of 
REC. To inhibit transcription, a compiemratary oligonucleotide is designed fiom the most 
unique 5* sequence and used to prevent promoter binding to the coding sequence. To 
inhibit translation, a complementary oligonucleotide is designed to prevent ribosomal 
buiding to the REC-encoding transcript 

30 DC Expression of REG 

E9q>ression and purification of REC is achieved using bacterial or virus-based 
expression systems. For expression of REC in bacteria, cDNA is subcloned into an 
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qipiopriate vector contakung ai^ antibiotic resistance gene ffiid an mduciUe promoter that 
directs high levels of cDHA transcription. Examples of such promoters include, but are not 
limited to, the trp-lac {tad) hybrid promoter and the TS or T7 bacteriophage promoter in 
conjunction with the lac operator regulatory element Recombinant vectors are transfonned 

5 into suitable bacterial hosts, e.g., BL21(DE3). Antibiotic resistant bacteria express REC 
upon induction with isopropyl beta-D-thiogalactopyranoside (IPTG). Expression of REC in 
eukaiyotic cells is achieved by infecting insect or mammalian cell lines with recombinant 
Autopraphica califomica nuclear polyhedrosis virus (AcMNPV), commonly known as 
baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA 

10 encoding REC by either homologous recombination or bacterial-mediated transposition 
involving transfer plasmid intermediates. Viral infecdvity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is 

used to infect Spodoptera fru^perda (Sf9^ insect celk in mngf «igPQ, or hiifnan y,P^^nry\P^ 

in some cases. Infection of the latter requires additional genetic mndificfltions to 

15 baculovirus. (See Engelhard, E. K. et al. (1994) Proc. Nati. Acad. Sci. USA 91 :3224-3227; 
Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, REC is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, 
permitting rapid, single-step, afiSnity-based purification of recombinant fusion protein firom 

20 crude cell lysates. GST, a 26'kilodalton enzyme from Schistosoma japonicum enahlea the 
purification of fiision proteins on immobilized glutathione under conditions that m^int^in 
protein activity and antigenicity (Pharmacia, Piscataway, NJ). Following purification, the 
GST moiety can be proteolytically cleaved from REC at specifically engineered sites. 
FLAG, an 8-amino acid peptide, enables inmiunoaffinity purification using commercially 

25 available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak, Rochester, 
NY). 6-His, a stretch of six consecutive histidine residues, enables purification on metal- 
chelate resins (QIAGEN Inc, Chatsworth, C A). Methods for protein e}q)ression and 
purification aro discussed in Ausubel, F. M. et al. (199S and periodic supplements) Current 
Protocols in Molecular Bioloev. John Wiley A Som^ Ni^ VnA KV rh 1 n 1 /; Purified 

30 REC obtained by these methods can be used directly in the following activity assay. 
X. Demonstration of REC Activity 

An assay for REC activity is based on a prototypical assay for ligand-teceptor 
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activity. This assi^ measues the stimulation of DNA synthesis in Swiss mouse 3T3 cells. 
A plasmid containing polynucleotides encoding REC are added to quiescrat 3T3 cultured 
cells using transfection methods well known in the art and the transfected cells are then 
incubated in the presence of [^thymidine, a radioactive DNA precursor. Varying 

5 amounts of REC ligand are then added to the cultured cells. Incorporation of pH]thymidine 
into acid-precipitable DNA is measured over an appropriate time interval using a 
radioisotope counter, and the amount incorporated is directly proportional to the amount of 
newly synthesized DNA. A linear dose-response curve over at least a hundred-fold REC 
ligand concentration range is indicative of recqitor activity. One unit of activity per 

1 0 millilit^ is defined as the concentration of REC producing a 50% response level, vrbm 
1 00% represCTis maximal incorporation of pH]thymidine into acid-precipitable DNA. 
(McKay, 1. and Leigh, I., eds. (1993) Growth Factors: A Practical Approach. Oxford 
University Press, New York, NY, page 73.) 
XL Fanctional Assays 

15 REC function is assessed by expressing the sequences encoding REC at 

physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into 
a mammalian expression vector containing a strong promoter that drives high levels of 
cDNA expression. Vectors of dioice include pCMV SPORT (Life Technologies) and pCR 
3.1 (Invitrogen, Carisbad CA) both of which contain the cytomegalovirus promoter. 5-10 

20 of recombinant vector are transiently transfected into a human cell line, preferably of 
endothelial or hematopoietic origin, xising either liposome formulations or electroporation. 
1-2 fzg of an additional plasmid containing sequences encoding a maricer protein are co- 
transfected. Expression of a marker protein provides a means to distinguish transfected 
cells firom nontransfected cells and is a reliable predictor of cDNA expression from the 

25 recombinant vector. Maricer proteins of choice include, e.g.. Green Fluorescent Protein 
(GFP) (Clontech, Palo Alto, C A), CD64, or a CD64-GFP fusion protein. Flow cytometry 
(FCM), an automated, laser optics*based technique, is used to identify transfected cells 
expressing GFP or CD64-GFP, and to evaluate properties, for example, their q)optotic 
state. FCM detects and quantifies the iqytake of fluorescent molecdesdiatdii^ose^ 

30 preceding or coincident with cell death. These events include changes in nuclear DNA 
content as measured by staining of DNA with propidium iodide; changes in cell size and 
granularity as measured by forward light scatter and 90 degree side light scatten down- 
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regulation of DNA synthesis as measored by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell sur&ce and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasma membrane composition as measured by 
the binding of fluorescein-conjugated Annexin V protein to the cell surface. Mediods in 
5 flow cytometry are discussed in Ormerod, M G. (1994) Flow Cytometry. Oxford, New 
York, NY. 

The influence of REC on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding REC and either CD64 or CD64- 
GFP. CD64 and CD64-GFP are expressed on the surfiu:eoftransfixted cells and bind 

10 conserved regions of human immunoglobulin G (IgG). Transfected cells are efBdently 
separated fix>m nontransfected celb using magnetic beads coated with either human IgC or 
antibody against CD64(DYNAL, Lake Success, NY). mRNA can be purified firom the 
cells using methods well known by those ofskill in the art Expression of mRNA 
encoding REC and otiier genes of interest can be analyzed by Nortiiem analysis or 

15 microanay techniques. 

Xn. Production of REC Specific Antibodies 

REC substantially purified using polyacrylamide gel electrophoresis (PAGE)(see, 
e.g., Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification 

20 techniques, is used to immunize rabbits and to produce antibodies using standard protocols. 
Alternatively, the REC amino acid sequence is analyzed using LASERGENE 
NAVIGATOR software (DNASTAR Inc.) to determine regions of high immunogenicity, 
and a corresponding oligopeptide is synthesized and used to raise antibodies by means 
known to those ofskill in the art Methods for selection of appropriate epitopes, such as 

25 those near the C-terminus or in hydrophilic regions are well described in the art (See, e.g., 
Ausubel supra, cIl 11.) 

Typically, oligopeptides IS residues in length are synthesized vsiog an Applied 
Biosystems Peptide Synthesizer Model 43 1 A using finoc-chemistry and coupled to KLH 
(Sigma, St Louis, MO) by reaction with N-maleimidobenzoy l-N-hydioj^succinimide ester 

30 (MBS) to mcrease immunogenicity. (See, e.g., Ausubel supra. ^ Rabbits are immunized 
witii the oligopeptide-KLH complex in complete Freund's adjuvant Resulting antisera are 
tested for antipeptide activity by, for example, binding the peptide to plastic, blocking with 
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1% BSA, reacting ivith rabbit anttsera, wasidng, and reacting ^vith radio-iodinated goat 
anti-rabbit IgG. 

Xni. Purification of Naturally Occurring REC Using Specific Antibodies 
Naturally occurring or recombinant REC is substantially purified by 

S immunoafBnity chromatography usiiig antibodies specific for REC An immunoaffinity 
column is constructed by covalently coupling anti-REC antibody to an activated 
chiomatogr^hic resin, such as CNBr-activated Sepharose ^haimacia & Upjohn). After 
the coupling, the resin is blocked and iwashed according to the manufiictura's instructions. 
Media containing REC are passed over the immunoafiinity column, and the column is 

10 washed under conditions that allow the preferential absorbance of REC (e.g., hi^ ionic 
strength buffers in the jmsence of deteigent). The column is eluted under conditions that 
disrupt antibody/REC binding (e.g., a bufiTer of pH 2 to pH 3 , or a high concentration of a 
chaotrope, such as urea or thiocyanate ion), and REC is collected. 
XIV. Identificatioii of Molecules Which Internet with REC 

IS REC, or biolo^cally active firagments thereof, are labeled with '^I Bolton-Hunter 

reagent. (See, e.g., Bolton etaL (1973) Biochem. J. 133:529.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled REC, 
washed, and any wells with labeled REC complex are assayed. Data obtained using 
different concentrations of REC are used to calcidate values for the number, affinity, and 

20 association of REC with the candidate moleciiles. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 

25 should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those dolled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . A substantially purified polypeptide comprising an amino acid sequence 
selected fiom the group consisting of SEQ ID N0:1, SEQ ID N0:2, SEQ ID N0:3, SEQ ID 
N0:4. SEQ ID N0:5, SEQ ID NO:6, SEQ ID N0:7, SEQ ID NO:8, SEQ ID N0:9, SEQ ID 

5 NO:10, SEQ ID NO:l 1, SEQ ID N0:12, SEQ ID N0:13, SEQ ID NO:14, SEQ ID N0:15, 
SEQ ID NO: 1 6, and fragments thereof. 

2. A substantially purified variant having at least 90% amino add identity to 
the amino acid sequence of claim 1. 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1. 
10 4. An isolated and purified polynucleotide variant having at least 90% 

polynucleotide sequence identity to the polynucleotide of claim 3. 

5. An isolated and purified polynucleotide which hybridizes under stringent 
conditions to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is 
15 complementary to the polynucleotide sequence of claim 3. 

7. An isolated and purified polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 17, SEQ ID NO: 1 8, SEQ ID 
NO:l9, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 

20 NO:30, SEQ ID N0:3 1 , SEQ ID NO:32, and fragments thereof. 

8. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide sequence identity to the polynucleotide of claim 7. 

9. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 7. 

25 10. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

11. A host cell comprising the e?q»ession vector of claim 10. 

12. A method for producing a polypeptide comprising the amino acid sequence 
selected bom the group consisting of SEQ ID NO: 1, SEQ ID N0:2, SEQ ID NO:3, SEQ ID 

30 NO:4, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID N0:8, SEQ ID N0:9. SEQ ID 
NO:10, SEQ ID NO:l 1, SEQ ID N0:12, SEQ ED N0:13, SEQ ID N0:14, SEQ ID NO:15, 
SEQ ID NO: 16, and fragments thereof, the method comprising the steps of: 
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a) culturittg the host ceil of dmm 1 1 under conditioiis suitable for the 
^qpiession of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

13. A pharmaceutical composition comprising the polypeptide of claim 1 in 
5 conjunction with a suitable pharmaceutical carrier. 

14. A purified antibody which specifically binds to the polypeptide of claim 1. 

15. A purified agonist of the polypeptide of claim 1. 

16. A purified antagonist of the polypeptide of claim 1 . 

10 17. A method for treating or preventing a neoplastic disorder, the method 

comprising admmistering to a subject in need of such treatment an efifective amount of the 
antagonist of claim 16. 

18. A method for treating or preventing an immunological disorder, the method 
comprising administering to a subject in need of such treatment an effective amount of tiie 

IS antagonist of claim 16. 

19. A method for treating or preventing a reproductive disorder, the meduxl 
comprising admmistering to a subject in need of such treatment an effective amount of the 
antagonist of claim 16. 

20. A method for treating or preventing a gastrointestinal disorder, the method 
20 comprising administering to a subject in need of such treatment an efifective amount of the 

antagonist of claim 16. 

21. A method for treating or preventing a nervous disorder, the method 
comprising administering to a subject in need of such treatment an efifective amount of the 
antagonist of claim 16. 

25 22. A method for treating or preventirig a smooth muscle disorder, the riiethod 

comprising administering to a subject in need of such treatment an efifective amount of the 
antagonist of claim 16. 

23. A method for treating or preventing a musculoskeletal disordCT, the method 
comprising administering to a subj ect in need of such treatment an efifective amount of the 

30 antagonist of claim 16. 

24. A metiiod for detecting a polynucleotide encoding the polypeptide 
comprising the amino acid sequence selected fiom the group consisting of SEQ ID NO: 1 , 
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SEQ ID N0:2. SEQ ID NO:3, SEQ ID N0:4, SEQIDNOrS, SEQ ID NO:6, SEQ ID N0:7. 
SEQ ID N0:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID N0:1 1, SEQ ID NO:12, SEQ ID 
N0:13, SEQ ID N0:14, SEQ ID NO:15, SEQ ID NO:16. and firagments thereof in a 
biological sample, the method comprising the steps of: 

(a) hybridizing the polynucleotide of claim 6 to at least one of the 
nucleic acids in the biological sample, thereby forming a hybridization complex; 
and 

(b) detecting the hybridization complex, M^ierein the presence of the 
hybridization complex correlates with die presence of the polynucleotide mcodmg 
the polypeptide in the biological sample. 

25. The method of claun 24 \idierem the nucleic acids of the biological sample 
are amplified by the polymerase chain reaction prior to the hylnridizing step. 
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SEQDENTCE LISTING 



<110> INCYTB VmSMACEmiCALS, INC. 
HILLMAN, Jennifer L. 
BANDMAN, Olga 
TANG* Y. Tom 
YUB, Henry 
LAL, Preeti 
CORLEY, Nell C. 
GUBGLER, Karl J. 
PATTERSON, Chandra 

<120> HUMAN RECEPTOR NOLECDIiES 

<130> PF-0516 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 09/071,822 
<151> 1998-05-01 

<160> 32 

<170> Perl Program 

<210> 1 

<211> 610 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte clone 044150 

<400> 1 



Met 


Ala 


Ala 


Pro 


Glu 


Ala 


Trp 


Arg Ala Airg Ser 


Cys Trp Phe Cys 


1 








5 






10 


15 


GlU 


Val 


Ala 


Ala 


Ala 


Thr 


Thr 


Met Glu Ala Thr 


Ser Arg Glu Ala 










20 






25 


30 


Ala 


Pro 


Ala 


Lys 


Ser 


Ser 


Ala 


Ser Gly Pro Asn 


Ala Pro Pro Ala 










35 






40 


45 


Leu 


Phe 


Glu 


Leu 


Cys 


Gly 


Arg 


Ala Val Ser Ala 


His Met Gly Val 










50 






55 


60 


Leu 


Glu 


Ser 


Gly 


Val 


Trp 


Ala 


Leu Pro Gly Pro 


He Leu Gin Ser 










65 






70 


75 


He 


Leu 


Pro 


Leu 


Leu 


Asn 


He 


Tyr Tyr Leu Glu 


Arg He Glu Glu 










80 






as 


90 


Thr 


Ala 


Leu 


Lys 


Lys 


Gly 


Leu 


Ser Thr Gin Ala 


He Trp Arg Arg 










95 






100 


105 


Leu 


Trp 


Asp 


Glu 


Leu 


Met 


Lys 


Thr Arg Pro Ser 


Ser Leu Glu Ser 










110 






115 


120 


Val 


Thr 


Cys 


Trp 


Arg 


Ala 


Lys 


Phe Met Glu Ala 


Phe Phe Ser His 










125 






130 


135 


Val 


Leu 


Arg 


Gly 


Thr 


lie 


Asp 


Val Ser Ser Asp 


Arg Arg Leu Cys 










140 






145 


150 
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Asp Gin A]?9 Phe Ser Pro Ser Ala Pro Ala Ala Thr Ser Ser Ala 

155 160 165 

Ser Ser Ser Thr Ser Ser Tyr Lys Arg Ala Pro Ala Ser Ser Ala 

170 175 180 

Pro Gin Pro Lys Pro Leu Lys Airg Phe Lys Arg Ala Ala Gly Lys 

185 190 195 

Lys Gly Ala Arg Thr Arg Gin Gly Pro Gly Ala Glu Ser Glu Asp 

200 205 210 

Leu Tyr Asp Phe Val Phe lie Val Ala Gly Glu Lys Glu Asp Gly 

215 220 225 

Glu Glu Met Glu lie Gly Glu Val Ala Cys Gly Ala Leu Asp Gly 

230 235 240 

Ser Asp Pro Ser Cys Leu Gly Leu Pro Ala Leu Glu Ala Ser Gin 

245 250 255 

Arg Phe Arg Ser lie Ser Thr Leu Glu Leu Phe Thr Val Pro Leu 

260 265 270 

Ser Thr Glu Ala Ala Leu Thr Leu Cys His Leu Leu Ser Ser Trp 

275 280 285 

Val Ser Leu Glu Ser Leu Thr Leu Ser Tyr Asn Gly Leu Gly Ser 

290 295 300 

Asn lie Phe Arg Leu Leu Asp Ser Leu Arg Ala Leu Ser Gly Gin 

305 310 315 

Ala Gly Cys Arg Leu Arg Ala Leu His Leu Ser Asp Leu Phe Ser 

320 325 330 

Pro Leu Pro lie Leu Glu Leu Thr Arg Ala lie Val Arg Ala Leu 

335 340 345 

Pro Leu Leu Arg Val Leu Ser lie Arg Val Asp His Pro Ser Gin 

350 355 360 

Arg Asp Asn Pro Gly Val Pro Gly Asn Ala Gly Pro Pro Ser His 

365 370 375 

lie lie Gly Asp Glu Glu lie Pro Glu Asn Cys Leu Glu Gin Leu 

380 385 390 

Glu Met Gly Phe Pro Arg Gly Ala Gin Pro Ala Pro Leu Leu Cys 

395 400 405 

Ser Val Leu Lys Ala Ser Gly Ser Leu Gin Gin Leu Ser Leu Asp 

410 415 420 

Ser Ala Thr Phe Ala Ser Pro Gin Asp Phe Gly Leu Val Leu Gin 

425 430 435 

Thr Leu Lys Glu Tyr Asn Leu Ala Leu Lys Arg Leu Ser Phe His 

440 445 450 

Asp Met Asn Leu Ala Asp Cys Gin Ser Glu Val Leu Phe Leu Leu 

455 460 465 

Oln Asn Leu Thr Leu Gin Glu He Thr Phe Ser Phe Cys Arg Leu 

470 475 480 

Phe Glu Lys Arg Pro Ala Gin Phe Leu Pro Glu Met Val Ala Ala 

485 490 495 

Met Lys Gly Asn Ser Thr I«eu Lys Gly Leu Arg Leu Pro Gly Asn 

500 505 510 

Airg Leu Gly Asn Ala Gly Leu Leu Ala Leu Ala Asp Val Phe Ser 

515 520 525 

Glu Asp Ser Ser Ser Ser Leu Cys Gin Leu Asp He Ser Ser Asn 

530 535 540 

Cys He Lys Pro Asp Gly Leu Leu Glu Phe Ala Lys Arg Leu Glu 

545 550 555 

Arg Trp Gly Arg Gly Ala Phe Gly His Leu Arg Leu Phe Gin Asn 
560 565 570 

Trp Leu Asp Gin Asp Ala Val Thr Ala Arg Glu Ala He Arg Arg 
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575 580 585 

Leu Arg Ala Thr cys His Val' Val Ser Asp Ser Trp Asp Ser Ser 

590 595 600 

Gin Ala Phe Ala Asp Tyr Val Ser Thr Met 

605 610 



<210> 2 
<211> 275 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_£eat:ure 

<223> Incyte clone 266775 

<400> 2 

Met Glu Ala Lys Thr Arg Glu lieu lie Ala Arg Axg Thr Thr Pro 
15 10 15 

Leu Leu Glu Tyr lie Lys Asn Arg Lys Leu Glu Lys Gin Arg lie 
20 25 30 

Arg Glu Glu Lys Arg Glu Glu Arg Arg Arg Arg Glu Leu Glu Lys 
35 40 45 

Lys Arg Leu Arg Glu Glu Glu Lys Arg Arg Arg Arg Glu Glu Glu 
50 55 60 

Arg Cys Lys Lys Lys Glu Thr Asp Lys Gin Lys Lys lie Ala Glu 
65 70 75 

Lys Glu Val Arg He Lys Leu Leu Lys Lys Pro Glu Lys Gly Glu 
80 85 90 

Glu Pro Thr Thr Glu Lys Pro Lys Glu Arg Gly Glu Glu He Asp 
95 100 105 

Thr Gly Gly Gly Lys Gin Glu Ser Cys Ala Pro Gly Ala Val Val 

110 115 120 

Lys Ala Arg Pro Met Glu Gly Ser Leu Glu Glu Pro Gin Glu Thr 

125 130 135 

Ser His Ser Gly Ser Asp Lys Glu His Arg Asp Val Glu Arg Ser 

140 145 150 

Gin Glu Gin Glu Ser Glu Ala Gin Arg Tyr His Val Asp Asp Gly 

155 160 165 

Arg Arg His Arg Ala His His Glu Pro Glu Arg Leu Ser Arg Arg 

170 175 180 

Ser Glu Asp Glu Gin Arg Trp Gly Lys Gly Pro Gly Gin Asp Arg 

185 190 195 

Gly Lys Lys Gly Ser Gin Asp Ser Gly Ala Pro Gly Glu Ala Met 

200 205 210 

Glu Arg Leu Gly Arg Ala Gin Arg Cys Asp Asp Ser Pro Ala Pro 

215 220 225 

Arg Lys Glu Arg Leu Ala Asn Lys Asp Arg Pro Ala Leu Gin Leu 

230 235 240 

Tyr Asp Pro Gly Ala Arg Phe Arg Ala Arg Glu Cys Gly Gly Asn 

245 250 255 

Arg Arg He Cys Lys Ala Glu Gly Ser Gly Thr Gly Pro Glu Lys 

260 265 270 

Arg Glu Glu Ala Glu 

275 
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<210> 3 
<211> 147 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_£eature 

<223> Incyte clone 843183 

<400> 3 

Met Cys Phe Leu Met lie Phe Thr Phe Leu Val Cys Trp Met Pro 
15 10 15 

Tyr lie Val lie Cys Phe Leu Val Val Asn Gly His Gly His Leu 

20 25 30 

Val Hit Pro Thr lie Ser lie Val Ser Tyr Leu Phe Ala Lys Ser 

35 40 45 

Asn Thr Val Tyr Asn Pro Val lie Tyr Val Phe Met lie Arg Lys 

50 55 60 

Phe Arg Arg Ser Ijeu Leu Gin Leu Leu Cys Leu Arg Leu Leu Arg 

65 70 75 

Cys Gin Arg Pro Ala Lys Asp Leu Pro Ala Ala Gly Ser Glu Met 

80 85 90 

Gin lie Arg Pro lie Val Met Ser Gin Lys Asp Gly Asp Arg Pro 

95 100 105 

Lys Lys Lys Val Thr Phe Asn Ser Ser Ser lie lie Phe He He 
110 115 120 

Thr Ser Asp Glu Ser Leu Ser Val Asp Asp Ser Asp Lys Thr Asn 
125 130 135 

Gly Ser Lys Val Asp Val He Gin Val Arg Pro Leu 
140 145 



<210> 4 

<211> 336 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte cl(»ie 965938 

<400> 4 

Met Asp Cys Val He Thr Gly Arg Pro Cys Cys He Gly Thr Lys 
15 10 15 

Gly Arg Cys Glu He Thr Ser Arg Glu Tyr Cys Asp Phe Met Arg 

20 25 30 

Gly Tyr Phe His Glu Glu Ala Thr Leu Cys Ser Gin Val His Cys 

35 40 45 

Met Asp Asp Val Cys Gly Leu Leu Pro Phe Leu Asn Pro Glu Val 

SO 55 60 

Pro Asp Gin Phe Tyr Arg Leu Trp Leu Ser Leu Phe Leu His Ala 

65 70 75 

Gly He Leu His Cys Leu Val Ser He Cys Phe Gin Met Thr Val 

80 85 90 

Leu Arg Asp Leu Glu Lys Leu Ala Gly Trp His Arg He Ala He 

95 100 105 

He Tyr Leu Leu Ser Gly Val Thr Gly Asn Leu Ala Ser Ala He 
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110 






115 






Phe 


Leu 


Pro Tyr 


Arg 
125 


Ala 


6Iu Vkl Gly 


Pro 
130 


Ala Gly Ser Gin 


Phe 


Gly 


He 


Leu Ala 


Cys 
140 


Leu 


Phe Val Glu 


Leu 
145 


Phe Gin Ser Trp 


Ala 


Asp 


Pro 


Gly Arg 


Gly 
155 


Pro 


Gly Val Pro 


Ser 
160 


Ser Ser Cys Trp 


Leu 


Trp 


Cys 


Ser Ser 


Ser 
170 


Ser 


Pro Leu Gly 


Cys 
175 


Cys Arg Gly Leu 


Thr 

ISO 


Thr 


Leu 


Pro Thr 


Ser 
185 


Arg 


Gly Ser Ser 


Val 
190 


Ala Ser Ser Ser 


Pro 

X79 


Ser 


Pro 


Ser Cys 


Pro 
200 


Thr 


Ser Ala Leu 


Ala 


Ser Ser Thr Cys 


Thr 
zxu 


Gly 


Asn 


Ala Ala 


Arg 
215 


Ser 


Ser Ser Phe 


Arg 


Trp Ser Ser Trp 


Ala 

'SOB 


Ser 


Trp 


Leu Ala 


Trp 

230 


Trp 


Ser Ser Ser 


Thr 


Ser He Leu Ser 


Ala 
240 


Val 


Ser 


Gly Val 


Ser 


Ser 


Ser Pro Ala 


Ser 


Pro Ser Leu Thr 


Ser 
255 


Ser 


Val 


Arg Ser 


Thr 
^ou 


Asn 


Trp Thr Leu 


Ser 


Ser Thr Glu Leu 


Ala 
270 


Ala 


Gly 


Ser Ser 


Gly 

275 


Arg 


Val Leu Gin 


Gin 
280 


Ala Arg Ala Arg 


His 

285 


Asp 


Leu 


Pro Glu 


Pro 
290 


His 


Arg Leu Thr 


Gly 
295 


Val Thr Cys Ser 


Met 
300 


Trp 


Gly 


Leu Ala 


Cys 
305 


Phe 


Leu Asn Thr 


Asp 
310 


Leu Phe Leu Val 


Pro 

315 


Cys 


Ser 


Leu Leu 


Leu 
320 


Asn 


Pro Ser Tyr 


Cys 
325 


Arg Ala Phe He 


lie 
330 


Leu 


Leu 


Pro Val 


Xle 
335 


Thr 











<210> 5 
<211> 213 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_£eatiu:e 

<223> Incyte clone 1441620 

<400> 5 



Met 


Ala 


He 


Thr 


Gin 


Phe 


Arg Leu Phe 


Lys 


Phe Cys Thr Cys 


I^u 


1 








5 






10 




15 


Ala 


Thr 


Val 


Phe 


Ser 


Phe 


Leu Lys Arg 


Leu 


He Cys Arg Ser 


Gly 










20 






25 




30 


Arg 


Gly 


Arg 


Lys 


Leu 


Ser 


Gly Asp Gin 


He 


Thr Leu Pro Thr 


Thr 










35 






40 




45 


Val 


Asp 


Tyr 


Ser 


Ser 


Val 


Pro Lys Gin 


Thr 


Asp Val Glu Glu 


Trp 










50 






55 




60 


Thr 


Ser 


Trp 


Asp 


Glu 


Asp 


Ala Pro Thr 


Ser 


Val Lys He Glu 


Gly 










65 






70 




75 


Gly 


Asn 


Gly 


Asn 


Val 


Ala 


Thr Gin Gin 


Asn 


Ser Leu Glu Gin 


Leu 










80 






85 




90 


Glu 


Pro 


Asp 


Tyr 


Phe 


Lys 


Asp Met Thr 


Pro 


Thr He Arg Lys 


Thr 










95 






100 




105 


Gin 


Lys 


He 


Val 


He 


Lys 


Lys Arg Glu 


Pro 


Leu Asn Phe Gly 


He 
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110 




Pro 


Asp 


Gly 


Ser 


«r 

125 


Gly 


Asp 


Leu 


Pro 


Phe 


He 
140 


His 


Thr 


Trp 


Gin 


Glu 


Asn 

155 


Thr 


Ala 


Trp 


Gin 


Ala 


Glu 
170 


Glu 


Arg 


Glu 


Lys 


Arg 


Ala 
185 


Ala 


Glu 


Ala 


Gin 


Arg 


Leu 
200 


Net 


Lys 


Leu 


Ser 









115 120 
Phe Ster Ser Arg Leu Ala Ala Thr Gin 

130 135 
Gin Ser Ser Glu Leu Gly Asp Leu Asp 

145 150 
Asn Ala Trp Glu Glu Glu Glu Asp Ala 

160 165 
Val Leu Arg Gin Gin Lys Leu Ala Asp 

175 180 
Glu Gin Gin Arg Lys Lys Met Glu Lys 

190 195 
Lys Lys Glu Gin Asn Lys He Gly Val 

205 210 



<210> 6 

<211> 338 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte clone 1510911 

<400> 6 



Net 


Thr 


Ala 


Leu 


Ser 


Ser 


Glu Asn Cys Ser Phe Gin Tyr Gin Leu 


1 








5 




10 15 


Arg 


Gin 


Thr 


Asn 


Gin 

20 


Pro 


Leu Asp Val Asn Tyr Leu Leu Phe Leu 
25 30 


He 


He 


Leu 


Gly 


Lys 
35 


He 


Leu Leu Asn He Leu Thr Leu Gly Met 
40 45 


Arg 


Arg 


Lys 


Asn 


Thr 
50 


Cys 


Gin Asn Phe Net Glu Tyr Phe Cys He 
55 60 


Ser 


Leu 


Ala 


Phe 


Val 
65 


Asp 


Leu Leu Leu Leu Val Asn He Ser He 
70 75 


He 


Leu 


Tyr 


Phe 


Arg 
80 


Asp 


Phe Val Leu Leu Ser He Arg Phe Thr 
85 90 


Lys 


Tyr 


His 


He 


Cys 
95 


Leu 


Phe Thr Gin He He Ser Phe Thr Tyr 
100 105 


Gly 


Phe 


Leu 


His 


Tyr 
110 


Pro 


Val Phe Leu Thr Ala Cys He Asp Tyr 
115 120 


Cys 


Leu 


Asn 


Phe 


Ser 
125 


Lys 


Thr Thr Lys Leu Ser Phe Lys Cys Gin 
130 135 


Lys 


Leu 


Phe 


Tyr 


Phe 
140 


Phe 


Thr Val He Leu He Trp He Ser Val 
145 150 


Leu 


Ala 


Tyr 


Val 


Leu 
155 


Gly 


Asp Pro Ala He Tyr Gin Ser Leu Lys 
160 165 


Ala 


Gin 


Asn 


Ala 


Tyr 
170 


Ser 


Arg His Cys Pro Phe Tyr Val Ser He 
175 180 


Gin 


Ser 


Tyr 


Trp 


Leu 

185 


Ser 


Phe Phe Met Val Met He Leu Phe Val 
190 195 


Ala 


Phe 


He 


Thr 


Cys 
200 


Trp 


Glu Glu Val Thr Thr Leu Val Gin Ala 
205 210 


He 


Arg 


He 


Thr 


Ser 
215 


Tyr 


Net Asn Glu Thr He I«eu Tyr Phe Pro 
220 225 


Phe 


Ser 


Ser 


His 


Ser 


Ser 


Tyr Thr Val Arg Ser Lys Lys He Phe 
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230 235 240 

Leu Ser Lys Leu lie Val Cys Phe Leu Ser Thr Trp Leu Pro Phe 

245 250 255 

Val Leu Leu Gin Val lie lie Val Leu Leu Lys Val Gin lie Pro 

260 265 270 

Ala Tyr lie Qlu Met Asn lie Pro Txp Leu Tyr Phe Val Asn Ser 

275 280 285 

Phe Leu lie Ala Thr Val Tyr Trp Phe Asn Cys His Lys Leu Asn 

290 295 300 

Leu Lys Asp lie Gly Leu Pro Leu Asp Pro Phe Val Asn Trp Lys 

305 310 315 

Cys Cys Phe lie Pro Leu Thr lie Pro Asn Leu Glu Gin lie Glu 

320 325 330 

Lys Pro He Ser He Met He Cys 

335 

<210> 7 

<211> 326 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_£eature 

<223> Incyte clone 2022379 

<400> 7 

Met Glu Pro Lys Ala Ser Cys Pro Ala Ala Ala Pro Leu Met Glu 
1 5 15 

Arg Lys Phe His Val Leu Val Gly Val Thr Gly Ser Val Ala Ala 
20 25 30 

Leu Lys Leu Pro Leu Leu Val Ser Lys Leu Leu Asp He Pro Gly 
35 40 45 

Leu Glu Val Ala Val Val Thr Thr Glu Arg Ala Lys His Phe Tyr 
50 55 60 

Ser Pro Gin Asp He Pro Val Thr Leu Tyr Ser Asp Ala Asp Glu 
65 70 75 

Trp Glu Met Trp Lys Ser Arg Ser Asp Pro Val Leu His He Asp 
80 85 90 

Leu Arg Arg Trp Ala Asp hexL Leu Leu Val Ala Pro Leu Asp Ala 
95 100 105 

Asn Thr Leu Gly Lys Val Ala Ser Gly He Cys Asp Asn Leu Leu 
110 115 120 

Thr Cys Val Met Arg Ala Trp Asp Arg Ser Lys Pro Leu Leu Phe 
125 130 135 

Cys Pro Ala Met Asn Thr Ala Met Trp Glu His Pro He Thr Ala 
140 145 150 

Gin Gin Val Asp Gin Leu Lys Ala Phe Gly Tyr Val Glu He Pro 
155 160 165 

Cys Val Ala Lys Lys Leu Val Cys Gly Asp Glu Gly Leu Gly Ala 
170 175 180 

Met Ala Glu Val Gly Thr He Val Asp Lys Val Lys Glu Arg Pro 
185 190 195 

Leu Pro Ala Gin Trp Leu Pro Ala Glu Leu Thr Trp Asp Phe Cys 
200 205 210 

His Gly Cys Pro Ser Val Leu Arg Net Gly Ser Gly Gin Val Gly 
215 220 225 

Glu Asp Gly Cys Trp Gin Asn Arg Arg He Pro Ser Phe Ala Glu 



7/28 



wo 99^7270 PCTAJS99/09191 







230 




235 




240 


Trp Gly 


Thr Cys 


ffer 


GTa 


Pro Ala Gin Gly 


Pro Gly Leu Leu 


Gin 






245 




250 




255 


Val Lys 


Leu Asp 


Gly 


Arg 


Pro Arg Ser Gin 


Phe Leu Ser Thr 








260 




265 




270 


Azg Gly 


Arg Cys 


Leu 


Glu 


Pro Leu Pro Thr 


Phe Ser Trp Met 


Gly 






275 




280 




285 


Glu Ala 


Ser Gin 


Glu 


Ser 


Lys Gin Cys Cys 


Pro His Gly Arg 


Arg 






290 




295 




300 


Thr Glu 


Arg Leu 


Gly 


Lys 


Leu Gly Ser Thr 


Ser His Pro Glu 


Arg 






305 




310 




315 


Leu Leu 


Glu Thr 


Pro 


Gin 


Leu Glu Ser Pro 


Gly 








320 




325 







<210> 8 
<211> 529 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte clone 2024312 



<400> 8 



Met 


Leu 


Val 


Leu 


Phe 


Glu 


Thr Ser Val Gly 


Tyr Ala He Phe 


Lys 


1 








5 




10 




15 


Val 


Leu 


Asn 


Glu 


Lys 


Lys 


Leu Gin Glu Val 


Asp Ser Leu Trp 


Lys 










20 




25 




30 


Glu 


Phe 


Glu 


Thr 


Pro 


Glu 


Lys Ala Asn Lys 


He Val Lys Leu 


Lys 










35 




40 




45 


His 


Phe 


Glu 


Lys 


Phe 


Gin 


Asp Thr Ala Glu 


Ala Leu Ala Ala 


Phe 










50 




55 




60 


Thr 


Ala 


Leu 


Met 


Glu 


Gly 


Lys He Asn Lys 


Gin Leu Lys Lys 


Val 










65 




70 




75 


Leu 


Lys 


Lys 


He 


Val 


Lys 


Glu Ala His Glu 


Pro Leu Ala Val 


Ala 










80 




85 




90 


Asp 


Ala 


Lys 


Leu 


Gly 


Gly 


Val He Lys Glu 


Lys Leu Asn Leu 


Ser 










95 




100 




105 


Cys 


He 


His 


Ser 


Pro 


Val 


Val Asn Glu Leu 


Met Arg Gly He 


Arg 










110 




115 




120 


Ser 


Gin 


Met 


Asp 


Gly 


Leu 


He Pro Gly Val 


Glu Pro Arg Glu 


Met 










125 




130 




135 


Ala 


Ala 


Met 


Cys 


Leu 


Gly 


Leu Ala His Ser 


Leu Ser Arg Tyr 


Arg 










140 




145 




150 


Leu 


Lys 


Phe 


Ser 


Ala 


Asp 


Lys Val Asp Thr 


Met He Val Gin 


Ala 










155 




160 




165 


He 


Ser 


Leu 


Leu 


Asp 


Asp 


Leu Asp Lys Glu 


Leu Asn Asn Tyr 


He 










170 




175 




180 


Met 


Arg 


Cys 


Arg 


Glu 


Trp 


Tyr Gly Trp His 


Phe Pro Glu Leu 


Gly 










185 




190 




195 


Lys 


He 


He 


Ser 


Asp 


Asn 


Leu Thr Tyr Cys 


Lys Cys Leu Gin 


Lys 










200 




205 




210 


Val 


Gly 


Asp 


Arg 


Lys 


Asn 


Tyr Ala Ser Ala 


Lys Leu Ser Glu 


Leu 










215 




220 




225 


Leu 


Pro 


Glu 


Glu 


Val 


Glu 


Ala Glu Val Lys 


Ala Ala Ala Glu 


He 
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230 235 240 

Ser Met Gly Thr Olu Val Ser Glu Glu Asp lie Cys Asn lie Leu 

245 250 255 

His Leu Cys Thr Gin Val lie Glu He Ser Glu Tyr Arg Thr Gin 

260 265 270 

Leu Tyr Glu Tyr Leu Gin Asn Arg Met Met Ala He Ala Pro Asn 

275 280 285 

Val Thr Val Met Val Gly Glu Leu Val Gly Ala Arg Leu He Ala 

290 295 300 

His Ala Gly Ser Leu Leu Asn Leu Ala Lys His Ala Ala Ser Thr 

305 310 315 

Val Gin He Leu Gly Ala Glu Lys Ala Leu Phe Arg Ala Leu Lys 

320 325 330 

Ser Arg Arg Asp Thr Pro Lys Tyr Gly Leu He Tyr His Ala Ser 

335 340 345 

Leu Val Gly Gin Thr Ser Pro Lys His Lys Gly Lys He Ser Arg 

350 355 360 

Met Leu Ala Ala Lys Thr Val Leu Ala He Arg Tyr Asp Ala Phe 

365 370 375 

Gly Glu Asp Ser Ser Ser Ala Met Gly Val Glu Asn Arg Ala Lys 

380 385 390 

Leu Glu Ala Arg Leu Arg Thr Leu Glu Asp Arg Gly He Arg Lys 

395 400 405 

He Ser Gly Thr Gly Lys Ala Leu Ala Lys Thr Glu Lys Tyr Glu 

410 415 420 

His Lys Ser Glu Val Lys Thr Tyr Asp Pro Ser Gly Asp Ser Thr 

425 430 435 

Leu Pro Thr Cys Ser Lys Lys Arg Lys He Glu Gin Val Asp Lys 

440 445 450 

Glu Asp Glu He Thr Glu Lys Lys Ala Lys Lys Ala Lys He Lys 

455 460 465 

Val Lys Val Glu Glu Glu Olu Glu Glu Lys Val Ala Glu Glu Glu 

470 475 480 

Glu Thr Ser Val Lys Lys Lys Lys Lys Arg Gly Lys Lys Lys Bis 

485 490 495 

He Lys Glu Glu Pro Leu Ser Glu Glu Glu Pro Cys Thr Ser Thr 

500 505 510 

Ala He Ala Ser Pro Glu Lys Lys Lys Lys Lys Lys Lys Lys Arg 

515 520 525 

Glu Asn Glu Asp 



<210> 9 
<211> 361 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte clone 2057886 



<400> 9 

Met Arg Gly Gin Arg Ser Leu Leu Leu Gly Pro Ala Arg Leu Cys 
15 10 15 

Leu Ajrg Leu Leu Leu lieu Leu Gly Tyr Arg Arg Arg Cys Pro Pro 
20 25 30 
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Leu Leu Arg 
Leu Arg Ser 
Val Asp Ala 
lie Arg Trp 
Thr Gly Ser 
lie I«eu Arg 
Tyr Ser His 
Ala He Thr 
He Ala Thr 
Ala Arg Thr 
Met Asp His 
Asn His Arg 
Cys Val Tyr 
Tyr Ala Ala 
Ser Phe Arg 
Phe Leu Cys 
Trp His Ala 
His He Asn 
Val Phe Arg 
Val Phe Leu 
Leu Leu Pro 
Glu Pro Pro 
Val 



Gly Leu Val 
35 

Leu Leu Tyr 
50 

Ala Phe Glu 
65 

Phe Gly Val 
80 

He Val Ala 
95 

Thr Tyr Ser 

110 
Trp Asn Leu 

125 
Thr Pro Pro 

140 
Val Ser He 

155 
His Bis Cys 

170 
His Cys Pro 

185 
Tyr Phe Phe 

200 
Cys Ser Tyr 

215 
He Glu Thr 

230 
Olu Arg Met 

245 
Ser Ser Val 

260 
Val Leu He 

275 
Lys Lys Glu 

290 
Asn Pro Tyr 

305 
Gly Val Asp 

320 
Ser Ser His 

335 
Pro Trp Val 

350 



Gin Arg Trp 
Asn Ser Phe 
Pro Val Tyr 
Val Phe Val 
He Ala Tyr 
Val Pro Arg 
He Leu He 
Gly Tyr Pro 
Cys Lys Lys 
Ser He Cys 
Trp Leu Asn 
Ser Phe Cys 
Gly Ser Trp 
Tyr His Gin 
Thr His Lys 
Ala Leu Ala 
Ser Arg Gly 
Arg Arg Arg 
Asn Tyr Gly 
Thr Gly Arg 
Leu Pro His 
Thr Ala His 



Arg 


Tyr Gly 


40 




Gly 


Gly Ser 


55 




Trp 


Leu Val 


70 




Val 


Leu Val 


85 




Leu 


Cys Val 


100 




Leu 


Cys Trp 


115 




Val 


Phe His 


130 




Pro 


Gin Gly 


145 




Cys 


He Tyr 


160 




Asn 


Arq Cys 


175 




Asn 


Cys Val 


190 




Phe 


Phe Met 


205 




Asp 


Leu Phe 


220 




Thr 


Pro Pro 


235 




Ser 


Leu Val 


250 




Leu 


Gly Ala 


265 




Glu 


Thr Ser 


280 




Leu 


Gin Ala 


295 




Cys 


Leu Asp 


310 




His 


Trp Leu 


325 




Gly 


Asn Gly 


340 




Ser 


Ala Ser 


355 





Lys 


Val Cys 




45 


Asp 


Thr Ala 




60 


Asp 


Asn Val 




75 


He 


Val Leu 




90 


Leu 


Pro Leu 




105 


His 


Phe Phe 




120 


Tyr 


Tyr Gin 




135 


Arg 


Asn Asp 




150 


Pro 


Lys Pro 




165 


Val 


Leu Lys 




180 


Gly 


His Tyr 




195 


Thr 


Leu Gly 




210 


Arci 


Glu Ala 




225 


Pro 


Thr Phe 




240 


Tyr 


Leu Trp 




255 


Leu 


Thr Val 




270 


He 


Glu Arg 




285 


Lys 


Gly Arcr 




300 


Asn 


Trp Lys 




315 


Thr 


Arg Val 




330 


Met 


Ser Trp 




345 


Val 


Met Ala 




360 



<210> 10 

<211> 361 

<212> PST 

<213> Homo sapiens 

<220> 

<22l> inisc_£eatiire 

<223> Incyte clone 2121924 
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<400> 10 



Met 


Phe 


Ala Lys 


'&BT'l*y9 Gly Ser Ala Val 


Pro Ser Asp 


Gly Gin 


1 






5 


10 




15 


Ala 


Arg 


Glu Lys 


Leu 


Ala Leu Tyr Val Tyr 


Glu Tyr Leu 


Leu His 








20 


25 




30 


Val 


Gly 


Ala Gin 


Lys 


Ser Ala Gin Thr Phe 


Leu Ser Glu 


He Arg 








35 


40 




45 


Trp 


Glu 


Lys Asn 


He 


Thr Leu Gly Glu Pro 


Pro Gly Phe 


Leu His 








50 


55 




60 


Ser 


Trp 


Trp Cys 


Val 


Phe Trp Asp Leu Tyr 


Cys Ala Ala 


Pro Glu 








65 


70 




75 


Arg 


Arg 


Asp Thr 


Cys 


Glu His Ser Ser Glu 


Ala Lys Ala 


Phe His 








80 


85 




90 


Asp 


Tyr 


Ser Ala 


Ala 


Ala Ala Pro Ser Pro 


Val Leu Gly 


Asn He 








95 


100 




105 


Pro 


Pro 


Asn Asp 


Gly 


Met Pro Gly Gly Pro 


He Pro Pro 


Gly Phe 








110 


115 




120 


Phe 


Gin 


Pro Phe 


Met 


Ser Pro Arg Tyr Ala 


Gly Gly Pro 


Arg Pro 








125 


130 




135 


Pro 


He 


Arg Met 


Gly 


Asn Gin Pro Pro Gly 


Gly Val Pro 


Gly Thr 








140 


145 




150 


Gin 


Pro 


Leu Leu 


Pro 


Asn Ser Met Asp Pro 


Thr Arg Gin 


Gin Gly 








155 


160 




165 


His 


Pro 


Asn Met 


Gly 


Gly Ser Met Gin Arg 


Met Asn Pro 


Pro Arg 








170 


175 




180 


Gly 


Met 


Gly Pro 


Met 


Gly Pro Gly Pro Gin 


Asn Tyr Gly 


Ser Gly 








185 


190 




195 


Met 


Arg 


Pro Pro 


Pro 


Asn Ser Leu Gly Pro 


Ala Met Pro 


Gly He 








200 


205 




210 


Asn 


Met 


Gly Pro 


Gly 


Ala Gly Arg Pro Trp 


Pro Asn Pro 


Asn Ser 








215 


220 




225 


Ala 


Asn 


Ser He 


Pro 


Tyr Ser Ser Ser Ser 


Pro Gly Thr 


Tyr Val 








230 


235 




240 


Gly 


Pro 


Pro Gly 


Gly 


Gly Gly Pro Pro Gly 


Ihr Pro He 


Met Pro 








245 


250 




255 


Ser 


Pro 


Ala Asp 


Ser 


Thr Asn Ser Ser Asp 


Asn He Tyr 


Thr Met 








260 


265 




270 


He 


Asn 


Pro Val 


Pro 


Pro Gly Gly Ser Arg 


Ser Asn Phe 


Pro Met 








275 


280 




285 


Gly 


Pro 


Gly Ser 


Asp 


Gly Pro Met Gly Gly 


Met Gly Gly 


Met Glu 








290 


295 




300 


Pro 


Ris 


His Met 


Asn 


Gly Ser Leu Gly Ser 


Gly Asp He 


Asp Gly 








305 


310 




315 


Leu 


Pro 


Lys Asn 


Ser 


Pro Asn Asn He Ser 


Gly He Ser 


Asn Pro 








320 


325 




330 


Pro 


Gly 


Thr Pro 


Arg 


Asp Asp Gly Glu Leu 


Gly Gly Asn 


Phe Leu 








335 


340 




345 


His 


Ser 


Phe Gin 


Asn 


Asp Asn Tyr Ser Pro 


Ser Met Thr 


Met Ser 








350 


355 




360 



Val 



<210> 11 

<211> 221 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> mlsc_£eacure 

<223> Incyte clone 2122815 



<400> 11 

Met Arg Gly Leu Bis Pro Trp His Val Leu Arg Arg Pro Leu Gly 
15 10 15 

Pro Gin Ala His Ala Asn Asp Pro Glu Cys Gly Gin Arg Pro Val 
20 25 30 

Pro Ala Leu Ser His His Gly Ser Gin Arg Val Val Leu Leu Gin 
35 40 45 

Thr Ala Thr Leu Leu Gly Val Leu Leu Leu Gly Tyr Gly Tyr Phe 
50 55 60 

Trp Leu Leu Val Pro Asn Pro Glu Ala Arg Leu Gin Gin Leu Gly 
65 70 75 

Leu Phe Cys Ser Val Phe Thr lie Ser Met Tyr Leu Ser Pro Leu 
80 85 90 

Ala Asp Leu Ala Lys Val lie Gin Thr Lys Ser Thr Gin Cys Leu 
95 100 105 

Ser Tyr Pro Leu Thr lie Ala Thr Leu Leu Thr Ser Ala Ser Trp 
110 115 120 

Cys Leu Tyr Gly Phe Arg Leu Arg Asp Pro Tyr lie Met Val Ser 
125 130 135 

Asn Phe Pro Gly He Val Thr Ser Phe He Arg Phe Trp I«eu Phe 
140 145 150 

Trp Lys Tyr Pro Arg Ser Lys Thr Gly Thr Thr Gly Ser Cys Lys 
155 160 165 

Pro Glu Ala Ala His Leu Thr Thr Gly His Leu Ser Ala Asn Leu 
170 175 180 

Asn Gin Arg Asp Leu Leu Val Ser Ala Gly Pro Ala Val Gin Leu 
185 190 195 

Pro Arg Cys Ser Gly Leu Trp Glu Gin Glu Met Thr Leu Arg He 
200 205 210 

Lys Gly Pro Lys Lys Lys Leu Tyr Leu Asp Asp 
215 220 



<210> 12 
<211> 238 
<212> PRT 

<213> Hbmo sapiens 
<220> 

<221> niisc_£eature 

<223> Incyte clone 2132179 



<400> 12 

Met Ala Leu Val Pro Cys Gin Val Leu Arg Met Ala He Leu Leu 
15 10 15 

Ser Tyr Cys Ser He Leu Cys Asn Tyr Lys Ala He Glu Met Pro 

20 25 30 

Ser His Gin Thr Tyr Gly Gly Ser Trp Lys Phe Leu Thr Phe He 

35 40 45 

Asp Leu Val He Gin Ala Val Phe Phe Gly He Cys Val Leu Thr 

50 55 60 
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Asp Leu 
Olu Arg 
Ala Val 
Trp lie 
Leu Asp 
Thr Val 
Gin Tyr 
Ser Val 
Gly Met 
Arg lie 
Tyr Leu 
Lys Ser 



Ser Ser Leu 

Gin Leu Lys 
80 

Leu Ala Phe 
95 

He Tyr Ala 

110 

Asn Phe He 
125 

Leu Pro Phe 
140 

Pro Ser Arg 
155 

Gly Tyr He 
170 

Trp Val Tyr 
185 

He Phe Phe 
200 

Leu Gly Glu 
215 

Met Glu Glu 
230 



Leu Thr 
Lys Leu 
Pro Val 
Tyr Asp 
Pro Gly 
He Leu 
Ser Ser 
Leu Trp 
Pro Phe 
Gly Ser 
Val Leu 
Glu Lys 



Arg Gly 
He Ser 
Gly Val 
Arg Glu 
Trp Leu 
He Glu 
Gly Leu 
Val Cys 
Leu Glu 
Thr Thr 
Asn Asn 
Glu Lys 



Ser Gly Asn 
70 

Leu Arg Asp 
85 

Phe Val Val 
100 

Met He Tyr 

115 

Asn His Gly 
130 

Met Arg Thr 
145 

Thr Ala He 
160 

Trp Val His 
175 

His He Gly 
190 

He Leu Met 
205 

Tyr He Trp 
220 

Pro Lys Leu 
235 



Gin Glu Gin 
75 

Trp Met Leu 
90 

Ala Val Phe 
105 

Pro Lys Leu 

120 

Met His Thr 
135 

Ser His His 
150 

Cys Thr Phe 
165 

His val Thr 
180 

Pro Gly Ala 
195 

Asn Phe Leu 
210 

Asp Thr Gin 
225 

Glu 



<210> 13 

<211> 348 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_£eature 

<223> Incyte clone 2326441 



<400> 13 



Met 


Gly 


Ala 


Ala 


Cys 


Pro 


Leu Ser Ser Pro Val Tyr Ser Thr Pro 


1 








5 




10 15 


Pro 


Pro 


Trp 


Leu 


Trp 
20 


Pro 


Trp Pro Thr Ser Met Gly Pro Gly Ser 
25 30 


Gly 


Arg 


Gly 


Thr 


Thr 
35 


Ser 


Cys Ala Thr Pro Val Thr Ala Ala Ser 
40 45 


Trp 


Leu 


Ala 


Pro 


Ala 
50 


Ser 


Met Leu Ala Cys Pro Gin Arg Asn Pro 
55 60 


Ser 


Thr 


Ser 


Ala 


Ala 
65 


Gly 


Pro Arg He Met Lys Asp Leu Thr Cys 
70 75 


Arg 


Trp 


Thr 


Pro 


Gly 
80 


Ala 


His Gly Glu Thr Phe Leu His Thr Asn 
85 90 


Tyr 


Ser 


Leu 


Lys 


Tyr 
95 


Lys 


Leu Arg Trp Tyr Gly Gin Asp Asn Thr 
100 105 


Cys 


Glu 


Glu 


Tyr 


His 
110 


Thr 


Val Gly Pro His Ser Cys His He Pro 
115 120 


Lys 


Asp 


Leu 


Ala 


Leu 
125 


Phe 


Thr Pro Tyr Glu He Trp Val Glu Ala 
130 135 


Thr 


Asn 


Arg 


Leu 


Gly 


Ser 


Ala Arg Ser Asp Val Leu Thr Leu Asp 
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140 




145 






150 


He 


Leu 


Asp V^l 


vai 


Thr Thr Asp Pro Pro 


Pro 


Asp Val 


His Val 








155 




160 






165 


Ser 


Arg 


Val Gly 


Gly 


Leu 


Glu Asp Gin Leu 


Ser 


Val Arg 


Trp Val 








170 




175 






180 


Ser 


Pro 


Pro Ala 


Leu 


Lys 


Asp Phe Leu Phe 


Gin 


Ala Lys 


Tyr Gin 








185 




190 






195 


He 


Arg 


Tyr Arg 


Val 


Glu 


Asp Ser Val Asp 


Trp 


Lys Val 


Val Asp 








200 




205 






210 


Asp 


Val 


Ser Asn 


Gin 


Thr 


Ser Cys Arg Leu 


Ala 


Gly Leu 


Lys Pro 








215 




220 






225 


Oly 


Thr 


Val Tyr 


Phe 


Val 


Gin Val Arg Cys 


Asn 


Pro Phe 


Gly He 








230 




235 






240 


Tyr 


Gly 


Ser Lys 


Lys 


Ala 


Gly He Trp Ser 


Glu 


Trp Ser 


His Pro 








245 




250 






255 


Tfar 


Ala 


Ala Ser 


Thr 


Pro 


Arci Ser Glu Ar<i 


Pro 


Gly Pro 


Gly Gly 








260 












Gly 


Ala 


Cys Glu 


Pro 




Glv Glv Glu Pro 


Ser 


Ser Gly 


Pro Val 








275 




280 










Arci 


Glu Leu 


Lvs 


Gin 


Phe Leu Gly Trp 


Ijeu 


TaVfl Ta/a 
<Uj^O JUjr O 


Bla Ala 

O l.S> MULCH 








290 




295 






300 


Tyr 


Cys 


Ser Asn 


Leu 


Ser 


Phe Arg Leu Tyr 


Asp 


Gin Trp 


Arg Ala 








305 




310 






315 


Trp 


Met 


Gin Lys 


Ser 


His 


Lys Thr Arg Asn 


Gin 


His Arg 


Thr Arg 








320 




325 






330 


Oly 


Ser 


Cys Pro 


Arg 


Ala 


Asp Gly Ala Arg 


Arg 


Glu Val 


Leu Pro 








335 




340 






345 


Asp 


Lys 


Leu 















<210> 14 
<211> 352 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 320 

<223> imknown, or other 
<220> 

<221> inisc_£eature 

<223> Incyte clone 2825826 



<400> 14 

Met Ser Met Leu Ala Glu Arg Arg Arg Lys Gin Lys Trp Ala Val 
15 10 15 

Asp Pro Gin Asn Thr Ala Trp Ser Asn Asp Asp Ser Lys Phe Gly 

20 25 30 

Gin Arg Met Leu Glu Lys Met Gly Trp Ser Lys Gly Lys Gly Leu 

35 40 45 

Gly Ala Gin Glu Gin Gly Ala Thr Asp His He Lys Val Gin Val 

50 55 60 

Lys Asn Asn His Leu Gly Leu Gly Ala Thr He Asn Asn Glu Asp 
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65 




70 


75 


Asn 


Trp 


lie Ala 


BCts 


ezn 


Asp Asp Phe Asn Gin Leu 


Leu Ala Glu 








80 




85 


90 


Leu 


Asn 


Thr Cya 


His 


Gly 


Gin Glu Thr Thr Asp Ser 


Ser Asp Lys 








95 




100 


105 


Lys 


Glu 


Lys Lys 


Ser 


Phe 


Ser Leu Glu Glu Lys Ser 


Lys lie Ser 








110 




115 


120 


Lys 


Asn 


Arg Val 


His 


Tyr 


Met Lys Phe Thr Lys Gly 


Lys Asp Leu 








125 




130 


135 


Ser 


Ser 


Arg Ser 


Lys 


Thr 


Asp Leu Asp Cys lie Phe 


Oly Lys Arcr 








140 




145 


150 


Gin 


Ser 


Lys Lys 


Thr 


Pro 


Glu Gly Asp Ala Ser Pro 


Ser Thr Pro 








155 




160 


165 


Glu 


Glu 


Asn Glu 


Thr 


Thr 


Thr Thr Ser Ala Phe Thr 


lie Gin Glu 








170 




175 


180 


Tyr 


Phe 


Ala Lys 


Arg 


Met 


Ala Ala Leu Lys Asn Lys 


Pro Gin Val 

vSft^b 








185 




190 




Pro 


Val 


Pro Oly 


Ser 


Asp 


lie Ser Glu Thr Gin Val 


Glti Arcy Lva 








200 




205 




Arg 


Gly 


Lys Lys 


Arg 


Asn 


Lys Glu Ala Thr Gly Lys 


Hsn Val Qlu 








215 




220 




SBT 


Tyr 


Leu Gin 


Pro 


Lys 


Ala Lys Aro His Thr Glu 


6lv Lvs Pt^q 








230 




235 




Glu 


Arcf 


Ala Glu 


Ala 


Gin 


Glu Aro Val Ala. Lvs Tjva 


*AjfO OCX #vLCi 








245 




250 




Pro 


Ala 


Glu Glu 


Gin 


Leu 


Ara Glv Pro Cva Tinn Asm 

'^^^J C^^W WJ^O 


nil n Got* Cov* 

mil ocsjb ovsjb 








260 






* /u 


Lvs 


Ala 


Ser Ala 


Gin 


ASD 


Ala Glv AsD His Val Gin 


4r^V Jr&U wXU 








275 




2fi0 




Qlv 


Aro 


Asn Plifi 


Thr 


Leu 


TjVA T.1/CI T.ti'a Ik-rw 
Ajj^o x'jbw A^jfa ujrfi* n£g 


oxy jjys liys 














300 


Lys 


Leu 


Gin Lys 


Pro 


Val 


Glu lie Ala Glu Asp Ala 


Thr Leu Glu 








305 




310 


315 


Glu 


Thr 


Leu Val 


Xaa 


Lys 


Glu Glu Glu Glu Arg Phe 


Gin Met Asn 








320 




325 


330 


Pro 


Ser 


Gin Pro 


Gly 


Pro 


Ser Asp His Ser Ala Val 


Arg Ala Leu 








335 




340 


345 


Arg 


Oly 


Gin Thr 


Pro 


Leu 


Ala 










350 









<210> 15 
<211> 210 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_£eature 

<223> Incyte clone 2936050 



<400> 15 

Met Gly Gly Gly Arg Gly Leu Leu Gly Arg Glu Thr Leu Gly Pro 
15 10 15 

Gly Gly Gly Cys Ser Gly Lys Ser Ser Leu Cys Tyr Trp Pro Pro 
20 25 30 

Leu Gly Ser Pro Gin Ala Pro Ser Leu Pro Arg Thr Leu Pro Leu 
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35 




40 






45 


Glu 


Pro 


Pro Arg 


cys 


P1*U 


XSisu itrg Ser Cys 


Pro Leu 


Pro Arg 


Ser 








50 




55 






60 


Ala 


Cys 


Leu Cys 


Ser 


Arg 


Asn Ser Ala Pro 


Gly Ser 


Cys Cys 


Ser 








65 




70 






75 


Ser 


Trp 


Ala Ala 


Leu 


Leu 


Ser Ala Leu Pro 


Pro Pro 


Ser Phe 


Ala 








80 




85 






90 


Ser 


Pro 


Ser Pro 


Ser 


Met 


His He Trp Thr 


lieu Ser 


Cys Thr 


Ser 








95 




100 






105 


Gly 


Ala 


Ser Trp 


Ala 


Pro 


Val Thr Tyr Trp 


Thr Asp 


His Pro 


Gin 








110 




115 






120 


Pro 


Leu 


Leu Pro 


Thr 


His 


Leu His Ser Ser 


Arg Leu 


Pro Ala 


Asn 








125 




130 






135 


Tyr 


He 


He Leu 


Pro 


Thr 


Asp Leu Arg Tyr 


His Cys 


His Arg 


His 








140 




145 






150 


Pro 


Pro 


His Leu 


Thr 


Asn 


Arg Leu Trp Leu 


Leu Val 


Met Trp 


Thr 








155 




160 






165 


His 


Leu 


Gly Gly 


He 


Arg 


Ala Gly His Ser 


Pro Trp 


Thr Val 


He 








170 




175 






180 


Gin 


Thr 


Ala Gly 


Arg 


Pro 


Pro Arg Ser Leu 


Ser Pro 


Ser Ala 


Arg 








185 




190 






Ids 


Pro 


lie 


Ser Ser 


Pro 


Ser 


Pro Glu Thr Ser 


Cys Val 


Pro Ala 


Thr 








200 




205 






210 



<210> 16 
<211> 318 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte clone 3428945 



<400> 16 



Met 


Gly 


Thr 


Ser 


Leu 


Leu 


Cys Trp Val Val 


Leu Gly Phe Leu 


Gly 


1 








5 




10 




15 


Thr 


Asp 


Ser 


Val 


Ser 


Thr 


Asp His Thr Gly 


Ala Gly Val Ser 


Gin 










20 




25 




30 


Ser 


Pro 


Arg 


Tyr 


Lys 


Val 


Thr Lys Arg Gly 


Gin Asp Val Thr 


Leu 










35 




40 




45 


Arg 


Cys 


Asp 


Pro 


He 


Ser 


Ser His Ala Thr 


Leu Tyr Trp Tyr 


Gin 










50 




55 




60 


Gin 


Ala 


Leu 


Gly 


Gin 


Gly 


Pro Glu Phe Leu 


Thr Tyr Phe Asn 


Tyr 










65 




70 




75 


Glu 


Ala 


Gin 


Pro 


Asp 


Lys 


Ser Gly Leu Pro 


Ser Asp Arg Phe 


Ser 










80 




85 




90 


Ala 


Glu 


Arg 


Pro 


Glu 


Gly 


Ser He Ser Thr 


Veu Thr He Gin 


Arg 










95 




100 




105 


Thr 


Glu 


Gin 


Arg 


Asp 


Ser 


Ala Met Tyr Arg 


Cys Ala Ser Ser 


Leu 










110 




115 




120 


Ala 


Thr 


Gly 


Gly 


Thr 


Gly 


Glu Leu Phe Phe 


Gly Glu Gly Ser 


Arg 










125 




130 




135 


Leu 


Thr 


Val 


Leu 


Glu 


Asp 


Leu Lys Asn Val 


Phe Pro Pro Glu 


Val 
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140 




145 








150 


Ala Val 


Phe GlQ 


Pro 


Ser 


Glu Ala Glu He 


Ser His 


Thr 


Gin 


Lys 






155 




160 








165 


Ala Thr 


Leu Val 


Cys 


Leu 


Ala Thr Gly Phe 


Tyr Pro 


Asp 


His 


Val 






170 




175 








180 


61u Leu 


Ser Trp 


Trp 


Val 


Asn Gly Lys Glu 


Val His 


Ser 


Gly 


Val 






185 




190 








195 


Ser Thr 


Asp Pro 


Gin 


Pro 


Leu Lys Glu Gin 


Pro Ala 


Leu 


Asn 


Asp 






200 




205 








210 


Ser Arg 


Tyr Cys 


Leu 


Ser 


Ser Arg Leu Arg 


Val Ser 


Ala 


Thr 


Phe 






215 




220 








225 


Trp Gin 


Asn Pro 


Arg 


Asn 


His Phe Arg Cys 


Gin Val 


Gin 


Phe 


Tyr 






230 




235 








240 


Gly Leu 


Ser Glu 


Asn 


Asp 


Glu Trp Thr Gin 


Asp Arg 


Ala 


Lys 


Pro 






245 




250 








255 


Val Thr 


Gin He 


Val 


Ser 


Ala Glu Ala Trp 


Gly Arg 


Ala 


Asp 


cys 






260 




265 








270 


Gly Phe 


Thr Ser 


Glu 


Ser 


Tyr Gin Gin Gly 


Val Leu 


Ser 


Ala 


Thr 






275 




280 








285 


He Leu 


Tyr Glu 


He 


Leu 


Leu Gly Lys Ala 


Thr Leu 


Tyr 


Ala 


Val 






290 




295 








300 


Leu Val 


Ser Ala 


Leu 


Val 


Leu Met Ala Met 


Val Lys 


Arg 


Lys 


Asp 






305 




310 








315 



Ser Arg Gly 



<210> 17 

<211> 2316 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc__£eature 

<223> Incyte clone 044150 



<400> 17 

cacctcccca agatggcggc gcccgaggcc 
gtagcggcgg caacgaccat ggaggccacg 
gcctcgggcc ccauicgctcc ccccgccctg 
catatggggg ttctggagag cggggtgtgg 
ctacctctgc tcaatatata ttacttggag 
ctctcaactc aggccatctg gcgccgactc 
agtttggeuia gtgtgacatg ttggcgagcc 
ctacgtggga ccattgatgt gtcttctgac 
tctgctccag cagccacctc ctctgcctct 
gctagctcag ccccacagcc taagccccta 
ggtgctcgca cccgtcaggg gcctggtgca 
attgtggctg gcgagaagga ggatggcgaa 
gctttggatg gatcagatcc cagctgcctg 
ttccgcagca tctccacctt ggagctattc 
acactatgcc acctgctgag ctcctgggtg 
ggcctgggct ctaacatctt ccgcctgcta 
ggatgtcgcc tccgtgccct gcatctcagt 
ctgacacgtg ctatcgtgcg agcactgccc 



tggcgcgccc ggagttgctg gttctgtgag 60 
tcccgggagg cggcgccagc gaagagctcg 120 
ttcgagctgt gcgggcg g gc ggtgagcgcc 180 
gccctcccag gcccaatact tcaaagcatc 240 
aggattgagg aaactgccct caagaaaggc 300 
^999a^9aac tgatgaagac aaggccttcc 360 
aagtttatgg aggccttttt ttcccatgtt 420 
aggcgtcttt gtgatcagcg gttctcacct 480 
tcttctacat cctcatacaa acgggcacca 540 
aagcgtttca agcgagctgc agggaagaag 600 
gagtctgaag acctgtatga cttcgttttt 660 
gagatggaga ttggggaagt ggcttgtgga 720 
gggcttccag cactggaagc ttcacaaaga 780 
acagttccac tctccacaga ggcagccctg 840 
tcactggaga gcctcacact ctcctacaat 900 
gacagcctgc gggccctgtc aggccaggct 960 
gacctgttct caccactgcc catcctggag 1020 
ctgctacggg tcctctctat tcgtgttgac 1080 
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cacccaagcc agogggacaa ccctggtgtg 
ataggcgatg aggagatacc agaaaactgc 
ggagcccagc cagccccact gctgtgctcc 
ctgtccctgg atagtgccac ctttgcctct 
ctcaaagagt acaacctagc cctgaaaaga 
tgtcagagcg aggtgctctt tttgctacag 
ttctgccgtc tgtttgagaa gcgcccagcc 
aagggcaact ccacactgaa gggcctccgg 
ctgctggcct tggcagatgt tttctcagag 
atcagttcca actgcatcaa gccagatggg 
tggggccgtg gagcctttgg tcacctgcgc 
gtcacagcca gggaagccat coggcggctc 
tgggactcat cccaggcctt cgcagattat 
acagtctcat gctcggtacc atcagcttgc 
caaccaccag ttctatcttt ctctttctgt 
tgcactgagg tcctggaggc cttgatgagg 
ttatagtctt tgggcccctt actcagtatc 
gtcatcatcui ttgctgaaga gatcccctcc 
aagcaggcac ccaggcttta gagaagtata 
ctgctctcag gcctcaeUicc ctttcatacc 
tcttataaaa taaattttgg gcaaacaaaa 



ccagggaatg cagggccccc tagccacata 1140 
ctggagcagt tggagatggg atttccacgg 1200 
gttctgaagg cctcgggttc tctgcagcag 1260 
ccccaggatt ttgggcttgt ttitgcaaaca 1320 
ctgagcttcc atgacatgaa tctcgctgac 1380 
aatctgactc tgcautgagat taccttctcc 1440 
caatttctgc ctgagatggt tgctgctatg 1500 
ctgccaggga accgcctggg gaatgctggc 1560 
gattcatcct cctctctctg tcagctggac 1620 
cttctggagt tcgccaagcg gctggagcgc 1680 
ctcttccaaa actggctgga ccaggatgca 1740 
cgggctacct gccatgtggt tagcgactca 1800 
gttagcacca tgtgatgggg cccgtacctc 1860 
aggggctgaa gcatgggctg cccagaaccc 1920 
cacctttttt ctcttttttc cttcttccct 1980 
cccagcaaac aggcattctc acagctgggt 2040 
ctgggaaccc tgggccagga ggttacagtg 2100 
cctgcccctg ggttcctgcc ttccc^cctc 2160 
gggggcttct tccctgctgg gcttaccaca 2220 
tttattcttt tttttaacca aaaaagtttt 2280 
aaaaaa 2316 



<210> 18 
<211> 2569 
<212> mui 

<213> Homo sapiens 
<220> 

<221> misc_£eat\ire 

<223> Incyte clone 266775 



<400> 18 

gauaatacc accaacaaag aaacacagat 
cagacggcta acgcttagga aagcgggtgc 
gcctcccggg cttccctgct cgtccgcgga 
tcttcctgtg gcctccacgc tggtgccgca 
ccccaagtca ggagagtctc tcggcgccac 
ccttagggtt tcagcctcgg aggaccggtt 
ccttgtaaag ttaacgtttt gcgtttgttt 
aactgagatg atttggcaca aaagttttat 
aaaagaggct aaattaatgg actattgtat 
tatttcagtc tttatcctca tctctactca 
gacatccttc tttttagaga tcgttttgat 
gaatatcctg cagtggtaga gtttgctcca 
aaaaaagatg ccaagactgg aagcatcgaa 
acctactgtg tggaggaaga gaagaccagt 
gaggcgaaga cauigagagct cattgctaga 
aatagaaaat tagaaaagca gagaattcga 
gagttagaaa agaaacgttt gcgggaagag 
tgcaaaaaaa aagagacaga taaacagaag 
cttcttaaga aaccagaaaa gggagaggaa 
gaggagattg atactggagg tggcaagcag 
gccaggccca tggaaggctc gctggaggag 
aaagagcaca gggatgtgga gagatctcaa 



gtggtttaca tgcuiaaatgc ggatgatttt 60 
ctttgaaagg accagcgttg cccgcccggc 120 
cggggcgctg cggggccggg gggcgccggc 180 
gccagtgcgg ttttaaatac cggagaaggt 240 
gggttcctct gggagtgcgc cctggccttg 300 
ctgggcagtg gagaagggac ctgagttctg 360 
ttgctaaaga atatccaagt tgttacaatt 420 
ctaaagtagt tcgttgtgcc cagaaaagga 480 
ttttcactga ccattttcac tgttatctct 540 
agagcataca ttaattttag gaatcctgat 600 
ggatatatct tccttgacag Ccuiaggccta 660 
ttccagaaga tagccaaaaa gaagctgaga 720 
gatgatccag aatataagaa gtttttagaa 780 
gccaaccctg agactctgct gggggagatg 840 
agaaccacac ctcttttgga at^tattaaa 900 
gaagagaagc gagaagaacg gaggaggaga 960 
gaaaaaagaa gaagaagaga agaagaaaga 1020 
aaaattgcag agaaagaagt aaggattaag 1080 
ccaaccacag agaaaccaaa agaaagagga 1140 
gaatcctgtg cccccggtgc agtcgtaaaa 1200 
ccccaggaga cgtcacacag cggcagtgat 1260 
gaacaagaat ctgaagcaca aagataccat 1320 



18/28 



wo 99/57270 PCTAJS99y09191 



gtggatgacg gcaggaggca cagagctcac 
gaggatgagc agagatogggg gaaaggaccf 
gacagcgggg ctccggggga ggccatggag 
agtccagcac ccagaaaaga gcgactggca 
gatccaggag ctcgcttccg agcgcgagag 
gaaggttcgg ggactggtcc tgagaagagg 
tggcctccat ggacgagcaa gggcatccca 
ggaaggagaa cttattcctt accaggaaac 
aaacacgcag aaaccattct aaagsiaagta 
cacagatttt accattcctg ttataaacta 
atttccactg gacagtagaa aaatatgtgt 
gtgtgagtcc tgaagcactt tcagtgctgt 
tttgtaagca gaggagctgt cagtcactcg 
cgagtgaaga gtagttcttg tttgttacaa 
tgctgtgttt tagtcctggt aggeuitattt: 
gcttcaacta ttgccctttc aacagttttg 
ctggatggat gtgtttgtgg tttgtaacca 
cccaaataac agaaattttg tt:cgggaagg 
ttttgagttt tagatacttt atttgtaaat 
tctgtagttt cttcctttca gtgaaattgc 
tttaaccatt tattaaataa aaattttgtt 



caogagcctg aacggctttc cagaaggagt 1380 
ggccaagaca gagggaagaa ggggagccag 1440 
agactgggaa gagcgcagag gtgtgacgac 1500 
aacaaggacc ggccagcctt gcagctgtat 1560 
tgtggcggaa acaggaggat ctgcaaggca 1620 
gaagaggcag agtgagtcac tgcacgcacc 1680 
gaaacgtgta aatgaccccg agtgtgactg 1740 
tggaagctaa aaatacagag ggtgacgtag 1800 
gtgatcttgt attaaattga gcagaattct i860 
gtatttgttg tttagccaaa acagaaaatg 1920 
aaaataggga agaaagttag tattggatca 1980 
gagaacgaca tccactttgg gtttcattcg 2040 
tgcttctcgg tggcctctga gccatgg t g t 2100 
cctttgtgag tcagccatgc ccgcaaagcg 2160 
atcagagttc acactatata aaacccaaca 2220 
ccactgaccg gatagaaacg gtttcagtct 2280 
ttacggttta aaccatggtt taagaatttg 2340 
gataaactag atatagcata cagagcctgt 2400 
aacttaaaat agctttctga aaccgtgcat 2460 
taaatgtcaa tgtatttttg gcactgcgat 2520 
aaagaaaaaa aaaaaaaaa 2569 



<210> 19 

<211> 1267 

<212> mOi 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte clone 843183 



<400> 19 

tgaccacgcg tccggccaga gtgatcaatt 
tctggctcta ctcactggcg tgggcaggag 
tggacgtaca cggactaggc tgcactgtgg 
cctttgtgct tttcttattt cttggctgcc 
gctatggcca tattctatat tccattc3gaa 
acatagctac acacttgaga atgaagattc 
tcttggatta aggagcacta ggcaccagcc 
gaattctgaa gaatttgttt tagctggttc 
atcatttcaui aattgcttca tccatagcaa 
aagaggcttc gttgtgtgga agatcttcag 
gaaaagaaac tggccaaaat gtgcttttta 
ccttatatog tgatctgctt cttgg t g g tt 
atatctattg tttcgtacct ctttgctaaa 
gtcttcatga tcagaaagtt tcgaagatcc 
aggtgccaga ggcctgctaa agacctacca 
attgtgatgt cacagaaaga tggggacagg 
tccatcattt ttatcatcac cagtgatgaa 
aatgggtcca aagttgatgt aatccaagtt 
aaagatgggg ccttaaattg gatgccactt 
tacccgttct atgtaatatc aacagaacct 
tatgctcttg ggcctcagga agaggttgaa 
ctttaca 



tttcctgggc ctggagggcc attacctaca 60 
cacctctcct gggatggaac aggtacatcc 120 
actggaaatc caaggatgcc aacgattcct 180 
tggtggtgcc cctg g g t gtc atagcccatt 240 
tgagagaggt gactcataac aacgatggag 300 
acagtgaaat tctttgaaaa tctacagatg 360 
atttatacag ccagagaaca atctgataga 420 
attctttagg acaaatttgt gcctgtgcca 480 
ttttgaaaac tatcctcaaa tgattgsuita 540 
acaattcaag tgatcaagat tttaaaatat 600 
atgatattca ccttcctggt ctgttggatg 660 
aatggtcatg gtcacctggt cactccaaca 720 
tcgaacactg tatacaatcc agtgatttat 780 
cttttgcagc ttctgtgcct ccgactgctg 840 
gcagctggaa gtgaaatgca gatcagaccc 900 
ccaaagacuia aagtgacttt caactcttct 960 
tcactgtcag ttgacgacag cgacaaaacc 1020 
cgtcctttgt aggaatgaag aatggcaacg 1080 
ttggactttc atcataagaa gtgtctggaa 1140 
tgtggtccag caggaaatcc gaattgccca 1200 
caaaaacaaa ttcttttaat tcaacgggtg 1260 

1267 
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<210> 20 
<211> 1691 

<212> DNA. 

<213> Homo sapiens 

<220> 

<22l> inisc_feature 

<223> Incyte clone 965938 



<400> 20 

ccgtagaccg tcatctgatt tggtcttttg 
aaagcttaca taagtggtat ttgcctcttt 
ccaactttca tcacatcatc ttatgtggaa 
aggacacaga ggttgtttct gattttcagc 
aagagtcttt tccaacatgc acgtgcatct 
ggaagagctc agtggttttg ccttcgactc 
acccaactgg acctgaccct gtgtccccct 
gtggcccatc catcccagcg ccccagagct 
ctgccaccag gatcccaggg tgtgtgatga 
agaagacatc accaagtggc cgatctgcac 
tccccacatg gactgtgtca tcacaggacg 
tgagatcacc tcccgggagt actgtgactt 
gctctgctct caggtgcact gcatggatga 
cgaggtgcct gaccagttct: accgcctgtg 
gcactgcctg gtgtccatct: gcttccagat 
aggctggcac cgcatagcca tcatctacct 
tgccatcttc ctgccatacc gagcagaggt 
ggcctgcctc ttcgtggagc tcttccagag 
gccttcttca agctgctggc tgtggtgctc 
attigacaact: ttgcccacat ctcg g g g ttc 
ttgccctaca tcagctttgg caagttcgac 
tttcaggtgg tcttcctggg cctcctggct 
gtcogctgtg agtggtgtga gttcctcacc 
aagtacgaac tggacgctca gctccactga 
ccagcaggcc agagccagac acgacctccc 
ctccatgtgg ggactggcct gtttcctgaa 
tctgttgaac ccctcgtact gccgggcatt 
acttgtttct tgaogaccac ctcatigtggc 
cattaacttg a 



atttgggtgt actcttccag catgtgtctt 60 
tccactaagc aatttttaaa aaaattcatc 120 
gtgccgtagt ccataccagt tccctgattt 180 
taacaggaat tatgccgtca aactccttgt 240 
caagagtggg ggtgacttgt agaagcggag 300 
ctgactccct gcgtgcccct cattgccctg 360 
cccacagtcc acgctggcag tgtggg t gaa 420 
tgcgggccac aiagagacagt ttggctctgt 480 
gccctcctcc gsuigaccctc atgagtggcc 540 
caaaaacagc gctgggaacc acaccaacca 600 
gccctgctgc attggcacca agggcaggtg 660 
catgaggggc tacttccatg aggaggccac 720 
tgtgtgtggg ctcctgcctt ttctcaaccc 780 
gctatccctc ttcctgcacg ccgggatctt 840 
gactgtcctg cgggacctgg agaagctggc 900 
gctgagtggt gtcaccggca acctggccag 960 
gggtcctgct: ggctcccagt tcggcatcct 1020 
ctgggcagat cctgggcgcg gccctggcgt 1080 
ttcctcttca cctttgggct gctgccgtgg 1140 
atcagtggcc tcttcctctc cttcgccttc 1200 
ctgtaccgga aacgctgcca gatcatcatc 1260 
ggcctggtgg tcctcttcta cgtctatcct 1320 
tgcatcccct tcactgacaa gttctgtgag 1380 
gctggctgcg ggctccagcg gccgtgtgct 1440 
tgagcctcac aggcttacag gag^cacctg 1500 
cacagacctc tttcttgtgc cttgttcact 1560 
tattatacta cttcctgtica taaccttcta 1620 
craataaatgg actgggagcg ttttagctgc 1680 

1691 



<210> 21 

<211> 1401 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte clone 1441620 



<400> 21 

gggtgacctc tgg gg tgagg aaactgcgac 
ccatgctccg ctcacgcgtg ggagactggg 
cctccaaagc cgccttcctc agggaaattt 
cttgtacctc tccaggccga tttttccaca 



tgggagcggg acccaggcgt gcagcattcg 60 
ctgtggggta ccggcccgga aagcaogcag 120 
gcgtgacctt actgccctcc gtctacaggc 180 
atttaaatcc cagttcacct ggtatccagc 240 



20/28 



wo 99/57270 



PCTAJS99/09191 



tccagcaact tagagcgttt cacgtcacgc 
aaaacgctcc LgLL fc t tcfcg aLuLgLycag 
cagtttcggt tatttaaatt ttgtacctgc 
ttaatatgca gatctggcag aggacggcuia 
acagttgatt attcatcagt tcctaagcag 
gaagatgcac ccaccagtgt aauigatcgaa 
aattctttgg aacaactgga acctgactat 
actcagaaaa ttgttattaa gaagagagaui 
acaggtttct ctagtagatt agcagctaca 
gaattaggtg acttagatac ctggcaggaa 
gcagcctggc aagcagaaga agttctgaga 
gcagcogaac aacaaaggaa gaaaatggaa 
caaaacaaaa ttggtgtgaa actttcataa 
agaaatctca gctccacaac ccaagcaaca 
acatactgct tgattttaat acattgatca 
gtaccttgaa ctcttagtga ttgagactca 
tcttcaacaC gctccaaata iiaagacattt 
atatatcagt acctctcaag ctagtgtttc 
tggtggaaaa gaactgtact gtctgttatg 
gggaatgcta caaggccggg t 



ogggcgccag gogtcggctt gtataacctg 300 
CgggWttga ttcccaccat ggecaccacc 360 * 
ctagcaacag tattctcatt cctaaagaga 420 
ttaagtggag accaaataac tttgccaact 480 
acagatgttg aagagtggac ttcctgggat 540 
ggagggaatg ggaatgtggc aacacaacaa 600 
tttaaggaca tgacaccaac tattaggaaa 660 
ccattgaatt ttggcatccc agatgggagc 720 
caagatctgc cttttattca tcagtcttct 780 
aataccaatg catgggaaga agaagaagat 840 
cagcagaaac tagcagacag agaaaagaga 900 
aaggaagcac aacggctaat gaagaaggaa 960 
cacatgttca aattttatca tgccag^gg 1020 
tttgtatgga tttaagagta ttttaagaag 1060 
ggccatccag gacaccacga ttctcccaaa 1140 
aaaaaacaaa aaagacttga gacaatgttt 1200 
gtttgctgta cagaaagtat cacaaatgga 1260 
tagctaaata aatgggtgta tataatttta 1320 
atttccttca atgtgcataa tgataaaata 1380 

1401 



<210> 22 

<211> 1987 

<212> USA 

<213> Homo sapiens 

<220> 

<221> mlsc^featiure 

<223> Incy^e clone 1510911 



<400> 22 

gggcgctgca ggtgtcgggg cctcaacctt 
cctcgaggtg gtggcagggc cgccccctgc 
ccggaggcag gttcggctgg aaggaaccgc 
ctccgagctt actcacatag catattggta 
aacataattg aaggcagtaa aagtgauuitt 
cactggagag gacagaaaat gaagcagtgt 
gaaatttaac taaaaatatg actgctctct 
tacgtcaaac aaaccagccc ct:agatgtta 
aaatattatt aaatatcctt acact^aggaa 
tggaatattt ttgcatttca ctagcattcg 
ttatattgta tttcagggat tt:tgtacttt 
gcctatttac tcaaattatt tcctttactt 
cagcttgtat agattattgc ctgaatttct 
aaaaattatt ttatttcttt acagtaattt 
tgggagaccc agccatctac caaiagcctga 
ctttctatgt cagcattcag agttactggc 
tagctttcat aacctgttgg gaagaagtta 
cctatatgaa tgaaactatc ttatattttc 
gatctaaaaa aatattctta tccaagctca 
ttgtactact tcaggtaatc attgttttac 
tgaatattcc ctggttatac tttgtcaata 
attgtcacaa gcttaattta aaagacattg 
agtgctgctt cattccactt acaattccta 
taatgatttg ttaatattat taattaaaag 



gcggaccgac agccatcgat cctcg g g t g g 60 
agtccggaga cgaacgcacg gaccgggcct 120 
tctcgcttcg tcctacactt gcgcaaatgt 180 
tatcaaaatg aaatgcaagg aaccaaaaat 240 
aaataggaag atcatcagtc aaggaagacc 300 
tttatcatgt gtatttcagc aggtcttctt 360 
cttcagagaa ctgctctttt cagtaccagt 420 
actatctgct attcttgatc atacttggga 480 
tgagaagaaa aaacacctgt: caaaatttta 540 
ttgatctttt acetttggta aacatttcca 600 
taagcattag gttcactaaa taccacatct 660 
atggcttttt gcattatcca gttttcctga 720 
ctaaaacaac caagctttca tttaagtgtc 780 
taatttggat ttcagtcctt gcttatgttt 840 
aggcacagaa tgcttattct cgtcactgtc 900 
tgtcattttt catggtgatg attttatttg 960 
ctactttggt acaggctatc aggataactt 1020 
ctttctcatc ccactccagt: tatactgtga 1080 
ttgtctgttt tctcagtacc tggttaccat 1140 
ttaciagttca gattccagca tatattgaga 1200 
gttttctcat tgctacagtg tattggttta 1260 
gattaccttt ggatccattt gtcaactgga 1320 
atcttgagca aattgaaaag cctatatcaa 1380 
ttacagctgt cataagatca taattttatg 1440 
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aacagaaaga actcaggaca tattaaaeiaa 
gactgatagc atttcagaat gtgtcttttg 
ttttatttta aaaacaaaat aattccaaga 
tacaaatatt actttgttat taacacaaaa 
atgtttgtgt tactcaaaaa aactactgga 
ctgacaactt taagatatca acctaaacat 
agtaaaaatt ggtcctaaaa tgatttggca 
ttaccctcca ttttaaatgg aggcaattaa 
gccaataatt cctgggaaga gaaggggttt: 
gttgaca 



taaactgaac tcuuiacaact tttgccccct 1500 
aagggctatg ataccattta ttaaatiagtg 1560 
agtttttata gttattcagg gacactatat 1620 
agtgataaga gttaacattt ggctatactg 1680 
tgcaaaetgt tatgtaaatc tgagatttca 1740 
ttttattaaa tgttcaaatg aaagcaagaa 1800 
taaagttcaa tgtaagaggg aggaggagga 1860 
ttttaaaaag gtataactcc tggttttaat 1920 
gggtggccca ccttatggga actcgggacc 1980 

1987 



<210> 23 

<211> 1208 

<212> BttA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Zncyte clone 2022379 



<400> 23 

cctaaatccc gacagcttta tagagcccag 
cagaccccac atggaaccaa aggcctcctg 
attccatgtt cttgtgggtg tcacggggag 
gtcaaagctt ttggacattc ctgggctgga 
acatttctac agcccccagg acattcctgt 
gatgtggaag agccgctctg acccagttct 
cctgctggtg gctcctcttg atgccaacac 
caacttgctt acctgcgtca tgcgggcctg 
ggccatgaac accgccatgt gggagcaccc 
ggcctttggc tatgtcgaga tcccctgtgt 
tctcggggcc atggctgaag tggggaccat 
agcacagtgg cttccagcag agttgacctg 
cagaatgggt tcaggccaag tcggtgaaga 
atttgctgaa tgggggacct gctctgagcc 
actggacgga aggcccaggt: ctcagtttct 
cctccccacc ttttcctgga tgggtgaggc 
cgggaggagg actgagcgac tgggaaaact 
agaaacacca cagctggaga gtcctggctg 
tgcccaggtg ccatgcgatc agcgcuigcct 
agaccatgcc atgagattga aggtgcgggg 
aaaaaaaa 



gcctggcagg ctcccagaac ttgaagccac 60 
tccagcagct gcacccttga tggagagaaa 120 
tgtcgcagcc ctgaagttgc ctcttctggt 180 
agtagcagtg gtcacaactg agagagccaa 240 
caccctctac agcgacgctg atgaatggga 300 
gcacattgac ctgcggaggt gggcagacct 360 
tctggggaag gtggccagtg gcatctgtga 420 
ggaccgcagc aagcccctgc tcttctgccc 480 
gatcacagcg cagcaggtag accagctcaa 540 
ggccaagaag ctggtgtgog gagatgaagg 600 
cgtggacaaa gtgaaagaac gt octet tec 660 
ggatttctgt catgggtgtc cctctgtact 720 
tggatgttgg caaaatagga ggataccctc 780 
tgcccagggg ccaggcctgc tccaggttaa 840 
ttcaaccagg agaggccget gcctagagcc 900 
aagccaggag agcaagcagt gttgtcctca 960 
cggctctaca tctcacccag aacggctttt 1020 
agccttggga gtttcagctc tttggcgggg 1080 
gcgagttggc aggactctga ggtttcctgc 1140 
aaataaagaa aaatcaccat ttagaaaaaa 1200 

1208 



<210> 24 

<211> 2030 

<212> DNA. 

<213> Homo sapiens 

<220> 

<221> ini8c_£eatiire 

<223> Incyte clone 2024312 



<400> 24 

cagagaagga gtagcgcgtt cgtgcgtcct agttccagta cagcgtggag ggtttaggca 60 
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gcgtgttctg attctttgcg ggacggcgag 
gaggcctttt gaggccgcgt agtcggtgtt 
cgtgcggcgc cctgacccgg cctcaccatg 
gccatcttta aggttctaaa tgagaagaaa 
tttgaaactc cagagaaagc aaacaaaata 
gatacagcag aagcattagc agcattcaca 
ctgaaaaaag ttctgcuigaa aatagtaaaa 
gctaaactag gaggg gt cat aaaggaaaag 
gttaatgaac ttatgagagg aattcgttca 
ccacgtgaaa tggcagctat gtgtcttgga 
aagtttagcg ctgataaagt agacacaatg 
ttggataaag aactaaacaa ctacattatg 
cctgaattag gaaaaattat ttcagataat 
ggcgatagga agaactatgc ctctgccaag 
gcagaagtga aagcagctgc agagatatca 
tgcaatattc tgcatctttg cacccaggtg 
tatgaatatc tacaaaatcg aatgatggcc 
gaattagttg gagcacggct tattgctcat 
gcagcttcta ccgttcagat tcttggagct 
agacgggata cccctaagta tggtctcatt 
cccaaacaca aaggaaagat ttctcgaatg 
tatgatgctt ttggtgagga ttcaagttct 
gaggccaggt tgagaacttt ggaagacaga 
gcattagcaa aaacagaaaa atatgaacac 
ggtgactcca cacttccaac ctgttctaaa 
gatgaaatta ctgaaaagaa agccaaaaaa 
gaagaagaaa aagtggcaga agaagaagaa 
aaaaagaaac acattaagga agaaccactt 
attgctagtc cagagaaaaa gaagaaaaag 
aaaggaatta cgattatatc acccggacac 
accagggatg ctctctaacg t:cuitcaaggg 
ctataacttc aaacctattt gtcttgacat 
tagagtcttt gatatacaaa taaaattttc 



cgcatttgtg ctttgcccgc cgcggcctag 120 
tttgaactga ctctacagct tctggcaggc 180 
ttggtgctgt ttgaaacgtc tgtgggttac 240 
cttcaagagg ttgatagttt atggaaagaa 300 
gtaaagctaa aacattttga gaaatttcag 360 
gctctgatgg agggcaaaat caataagcag 420 
gaagcccatg aaccgctggc agtagctgat 480 
ctgaatctca gttgtatcca tagtcctgtt 540 
caaatggatg gattaatccc tggg g tagaa 600 
ttggctcaca gcctgtctcg atatagattg 660 
attgttcagg caatttcctt gttagatgac 720 
cgatgtagag aatggtatgg ctggcatttc 780 
ttaacatact gcaagtgttt acagaaagtt 840 
ctttctgagt tgctgccaga agaagttgaa 900 
atgggaacag aggtttcaga agaagatatt 960 
attgaaatct ctgaatatcg aacccagctc 1020 
attgcaccca atgttacagt catggttggg 1080 
gcaggttctc ttttaaattt ggccaagcat 1140 
gaaaaggcac tttt:cagagc cctcaaatct 1200 
tatcatgctt cactcgt gg g ccagacaagt 1260 
ctggcagcca aaaccgtttt ggctatecgt 1320 
gcaatgggag ttgagaacag agccaaatta 1380 
gggataagaa aaataagtgg aacaggaaaa 1440 
aaaagtgaag tgetagactta cgatccttct 1500 
aaacgcaaaa tagaacaggt agataaagag 1560 
gccaagatta aagttaaagt tgaagaagag 1620 
acatctgtga agaagaagaa gaaaaggggt 1680 
tctgaggaag aaccatgtac cagcacagca 1740 
aaaaaaaaga gagagaacga ggattaacag 1800 
acatcatgct taagattcaa ctgggagcat 1860 
aaggttcagt aagacaaagt gatttatcat 1920 
caactctgtt aaccttatgt catcatttct 1980 
tttgtatttt aaaaaaaaaa 2030 



<210> 25 

<211> 1919 

<212> mOi 

<213> Homo sapiens 

<220> 

<221> misc_feat\xre 

<223> Incyte clone 2057886 



<400> 25 

gtccgggtcc gctgcctggc gctgcgggcg 
cccggccgga gcgccgagtc ggag g gg gt g 
cggtttggct gactggggag tcggcaggcg 
caaggagagt acaatactcc agttaccgaa 
ctccagtttc tgttgggttt tgagctacct 
aaagccctcc ctggctgcct cagggcaaaa 
ctgctgggcc cggcccgcct ctgcctccgc 
tgtccacctc tactccgggg tctagtacag 
tccctgctct acaactcctt tgggggcagt 
gtctactggc tggtagacaa cgtgatccgc 
atcgtgctga caggctccat tgtagctatc 



gcgggccatg gtggtttgga ttgaaccggg 60 
gcagtgagcg gcggcagagg ctacggggct 120 
gcaggtcttt tgtggggatg agccagggtg 180 
ttgaaaccat ccctgcagtg gagcagcctc 240 
gttaaataag tcagtgggat tgtcaaggac 300 
tcaggaacca tgcgaggcca gcggagcctg 360 
ctccttctgc tgctgggtta caggcgccgc 420 
cgctggcgct acggcaaggt ctgcctgogc 480 
gacaccgctg ttgatgctgc ctttgagcct 540 
tggtttggag tggtgttcgt ggtcctggtg 600 
gcctacctgt gtgtcctgcc tctcatcctc 660 
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ogsuicctact cagtgccacg actctgct9g 
ctgattgtct tccactacta ccaggccatc 
aggaatgata tcgccaccgt ctccatctgt 
acacaccact gcagcatctg caacaggtgt 
ctaaacaatt gtgtgggcca ctataaccat 
actctgggct gtgtctactg cagctatgga 
gccattgaga cttatcacca gaccccacca 
cacaagagtc ttgtctacct ctggttcctg 
ctaactgtat ggcatgctgt tctcatcagt 
aacaagaagg agagacgtcg gctacaggcc 
tacggctgct tggacaactg gaaggtattc 
actcgggtgc tcttaccttc tagtcacttg 
cctccctggg tgactgctca ctcagcctct 
ccacgactcg agcactcatt ctgctcccta 
ttctcagaat ccttgatcaa aaagagccag 
attcaaggac cagccttttt accactgcag 
actgacatcc ctttactcag gcaaacagaa 
tagctaccta ccttgcccag tgctgacccg 
gccaccacct cttctacttg ctgtctgaaa 
gcttctcaac agggcaaaga taccaggcct 
ctgcttaagg gagcacaaat aaaggtattc 



catttcttct atagccactg gaatctgatc 720 
accaetccgc ctgggt:accc accccagggc 760 
aagaagtgca tttaccccaa gccagcccga 840 
gtgctgaaga tggatcacca ctgcccctgg 900 
oggtacttct tctctttctg ctttttcatg 960 
agttgggacc ttttccggga ggcttatgct 1020 
cccaccttct cctttcgaga aaggatgact 1080 
tgcagttctg tggcacttgc cctgggtgcc 1140 
cgaggtgaga ctagcatcga aaggcacatc 1200 
aagggcagag tatttaggaa tccttacaac 1260 
ctgggtgtgg atacaggaag gcactggctt 1320 
ccccatggga atggaatgag ctgggagccc 1380 
gtgatggcag tgtgagctgg actgtgtcag 1440 
tgttatttca agggcctcca agggcagctt 1500 
tgggcctgcc ttagggtacc atgcaggaca 1560 
aagaaagaca caatgtggag aaatcttagg 1620 
gttccaaccc cagactaggg gtcaggcagc 1680 
gacctcctcc aggatacagc actggagttg 1740 
aaacacctga ctagtacagc tgagatcttg 1800 
gctgctgagg tcactgccac ttctcacatg 1860 
gatttttaaa aaaaaaaaaa aaaaaaaaa 1919 



<210> 26 

<211> 1943 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi8c_£eat\ire 

<223> Incyte clone 2121924 



<400> 26 

aggagcgagg agcagctcgg gagagccgga 
gcggcgaggc tcggcgccct cttccctgca 
ggtgccctcg gatgggcagg ctcgggaauia 
gcacgtagga gcacagaaat ctgcacagac 
catcaogttg ggagaaccgc ctgg gt tttt 
ttactgtgca gctcctgaaa ggagagacac 
tcatgattat agtgcagcag ctgccccgag 
tgggatgccg ggaggcccca tcccgccagg 
cgcaggcggc cccaggcccc cgatcagaat 
gacacagcca ttgctgccca attctatgga 
gggaggatca atgcagagaa tgaaccctcc 
acagaattac ggcagtggca tgagaccacc 
gattaacatg ggcccgggag ctggcagacc 
tccatactcc tcctcatcac ctggtaccta 
aggaacaccc attatgccca gtcccgcaga 
aatgattaat ccagtgccgc ctggaggcag 
ggacgg Lccg atgggcggca tgggtggca^ 
gtcaggcgac atagacggac ttccaaaaaa 
tcctccaggc acccctcgag atgacggcga 
gaacgacaat tattctccaa gcatgacgat 
tgagagagca ggcattgcag gcgggaagat 
ttatccagga gctggtggg g agggcatctc 
ccacgccccc tacctttccc aattttagtt 



gcggtagcag cagcagcggc ggcggcggcg 60 
aaccatgttt gccaaaggca aaggctcggc 120 
gttagcttta tacgtctacg aatatttact 180 
cttcttatcg gagattcgat gggaaaaaaa 240 
gcactcgtgg tggtgtgtat tttgggacct 300 
ttgtgaacat tcaagtgaag caaaagcctt 360. 
ccccgtgctt ggcaacattc cccccaacga 420 
tttctttcag ccttttatgt caccgcgata 480 
gggasiaccag cctccgggag gagttcctgg 540 
tcccacaoga caacaaggcc accccaacat 600 
ccgaggcatg gggcccatgg gtcccggccc 660 
acceaactcc ctcggccccg ccatgcccgg 720 
ctggcccaat cctaacagtg ctaactcaat 780 
tgtgggaccc cctggtggtg gcggticctcc 840 
ttcaacaaat tccagtgaca acatctacac 900 
ccggtccaac ttcccgatgg gtcccggctc 960 
ggagccacac cacatgaatg gatcattagg 1020 
ttctcctaac aacataagtg gcattagcaa 1080 
gctaggaggg aacttcctcc actcctttca 1140 
gagtgtgtga tccccccttc tccgagacgc 1200 
gccagaaatt atgcaagaag tgaggtgtca 1260 
cctgctcccc tcaaccccct cccaccccat 1320 
tcatgcaata aaaaggccaa actttttatt 1380 



24728 
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ccataaaaca agaaggacaa aactctcaaa aatgtatttc aagtcagtga ccagaaaaat 
cccacccctt gccctttccc caaaggacct tttctgtaca tgacactttt ttgttgtttt 
ttgtttgggg ttttaccatt gttgggattt ttttatttgt tttcaggggg gttttttggg 
ggaaaatttt tttaaatgga agcttctagc aagcccccca ccccaatcaa cctctatgct 
ttcttcttaa aaaaaaaaaa asiagggaaaa ggaaaaaaaa aaaagggaaa accagaagcc 
ctgctgtctg tctggggccc aagccctttc accagaaaag ctagtctagg tgtgagagcc 
cacattgtct gtagccatca aaaataataa taataaactg ggacagttta ccaatccaaa 
aaaaaaaaaa a£i£uuiggggc gggccgcttc tiagaaggatt caaagcttac cgtaacgcgg 
tgcaatgcga cggtccaaag ccccttctta aaggggtccc ctaaaattcc aaattcacct 
ggcc cgt cgg tttttacaac egg 



<210> 27 

<211> 1174 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc^featxire 

<223> Incyte clone 2122815 



<400> 27 

ggcaacgcag ctcgcggcgg gcgctgggcg cgggatccga ctctagtcgt aatggaggcg 60 
ggcggctttc tggactcgct catttacgga gcatgcgtgg tcttcaccct tggcatgttc 120 
tccgccggcc tctcggacct caggcacatg cgaatgaccc ggagtgtgga caacgtccag 180 
ttcctgccct ttctcaccac ggaagtcaac gtgttgtgct cctacagact gcaaccctgc 240 
taggggtcct tctcctg g g t tatggctact tttggctcct ggtacccaac cctgaggccc 300 
ggcttcagca gttgggcctc ttctgcagtg tcttcaccat cagcatgtac ctctcaccac 360 
tggctgactt ggctaaggtg atCcaaacta aatcaaccca atgtctctcc tacccactca 420 
ccattgctac ccttctcacc tctgcctcct ggtgcctcta tgggtttcga ctcagagatc 480 
cctatatcat ggtgtccaac tttccaggaa tcgtcaccag ctttatccgc ttctggcttt 540 
tctggaagta ccccaggagc aagacaggaa ctactggctc ctgcaaacct gaggctgctc 600 
atctgaccac tgggcacctt agtgecaacc tgaaccaaag agacctcctt gtttcagctg 660 
ggcctgctgt ccagcttccc aggtgcagtg ggttgtggga acaagagatg actttgagga 720 
taaaaggacc aaagaaaaag ctttacttag atgattgatt ggggcctagg agatgaaatc 780 
actttttatt titttagagat tttttttttt aattttggag gttggggtgc aatctttaga 840 
atatgcctta aaaggccggg cgcggtggct cacgcctgta atcccagcac tttgggaggc 900 
caaggtgggc ggatcgcctg aggtcaggag ttcaagacca acctgactaa catggtgaaa 960 
ccccatctct actaaaaata caaaattagc caggcatgat ggcacatgcc tgtaatccca 1020 
gatacttggg aggctgaggc aggagaattg cttgaaccca ggaggtggag gttgcagtga 1080 
gctgagatcg tgccattgtg atatgaatat gccttatatg ctgatatgaa tatgccttaa 1140 
aataaagtgt tccccacccc tgccaaaaaa aaaa 1174 



<210> 28 
<211> 1374 

<2i2> vnn 
<213> Homo sapiens 

<220> 

<221> mi8c_£eature 
<223> Incyte clone 2132179 

<400> 28 

oggtccaggc ctctggcgaa catggcgctt gtcccctgcc aggtgctgcg gatggcaatc 60 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1943 
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ctgctgtcct actgctctat cctgtgtaac tacaaggcca tcgaaatgcc ctcacaccag 120 
acctacggag yyuycLyyaa are eg tgae g ttcattgatc tggttatcca ggctgt:cttt 190 
tttggcatct gtgtgctgac tgatctttcc agtcttctga ctcgaggaag tgggaaccag 240 
gagcaagaga ggcagctcaa gaagctcatc tctctccggg actggatgtt agctgtgttg 300 
gcctttcctg ttggggtttt tgttgtagca gtgttctgga tcatttatgc ctatgacaga 360 
gagatgatat accogaagct gctggataat tttatcccag ggtggctgaa tcaoggaatg 420 
cacacgacgg ttctgccctt tatattaatc gagatgagga catcgcacca tcagtatccc 480 
agcaggagca gcggacttac cgccatatgt accttctctg ttggctatat attatgggtg 540 
tgctgggtgc atcatgtaac tggcatgtgg gtgtaccctt tcctggaaca cattggccca 600 
ggagccagaa tcatcttctt tgggtctaca accatcttaa tgaacttcct gtacctgctg 660 
ggagaagttc tgaacaacta tatctgggat acacagaaaa gtetggaaga agagaaagaa 720 
aagcctaaat tggaatgaga tccaagtcta aacgcaagag ctagattgag ccgccattga 780 
agactccttc ccctcgggca ttggcagtgg gggagaaaag gcttcaaagg aacttggtgg 840 
catcagcacc cccctccccc aatgaggaca ccttttatat ataaatatgt ataaacatag 900 
aatacagttg tttccaaaag aactcaccct cactgtgtgt taaagaattc ttcccaaagt 960 
cattactgat aataacattt tttccttttc tagttttaaa accagaattg gaccttggat 1020 
ttttattttg gcaattgtaa ctccatctaa tcaagaaaga ataaaagttt attgcacttc 1080 
tttttgagaa atatgttaaa gtcetaagggg catatataga gtaaggcttt tgtgtattta 1140 
atccteuuigg tggctgtaat catgaiaccta ggccaccatg gggacctgag agggaagggg 1200 
acagatgttt ctcattgcat aatgtcacag ttgcctcaaa tgagcaccat ttgtaataat 1260 
gatgtcaatt tcatgaaaag cctgagtgta ttgcatctct tgatttaatc atgtgaaact 1320 
tttcctagat gcaaatgctg actaataaag acaaagccac cctgaaaaaa aaaa 1374 



<210> 29 

<211> 1498 

<212> IXfA 

<213> Homo sapiens 

<220> 

<221> iiii8C_feature 

<223> Incyte clone 2326441 



<400> 29 

cacagctgtg atcagt:cccc aggatcccac gcttctcatc ggctcctccc tgctggccac 60 
ctgctcagtg cacggagacc caccaggagc caccgccgag ggcctctact ggaccctcaa 120 
tgggcgccgc ctgccccctg agctctcccg tgtactcaac gcctccacct tggctctggc 180 
cctggccaac ctcaatigggt ccaggcagcg gtcgggggac aacctogtgt gccacgcccg 240 
tgacggcagc atcctggctg gctcctgcct ctatgttggc ctgcccccag agaaacccgt 300 
caacatcagc tgctggtcca agaatcatga aggacttgac ctgccgctgg acgccagggg 360 
cccacgggga gaccttcctc cacaccaact actccctcaa gtacaagctt: aggtggtatg 420 
gccaggacaa cacatgtgag gagtaccaca cagtggggcc ccactcctgc cacatcccca 480 
aggacctggc tctctttacg ccctatgaga tctgggtgga ggccaccaac cgcctgggct 540 
ctgcccgctc cgatgtactc acgctggata tcctggatgt ggtgaccacg gaccccccgc 600 
ccgacgtgca cgtgagccgc gtcgggggcc tggaggacca gctgagcgtg cgctgggtgt 660 
cgccacccgc cctcaaggat tCcctctttc aagccaaata ccagatccgc taccgagtgg 720 
aggacagtgt ggactggaag gtggtggacg atgtgagcaa ccagacctcc tgccgcctgg 780 
ccggcctgaa acccggcacc gtgtacttcg tgcaagtgcg ctgcaacccc tttggcatct 840 
atggctccaa gaaagccggg atctggagtg agtggagcca ccccacagcc gcctccactc 900 
cccgcagtga gcgcccgggc ccgggcggcg gggcgtgcga accgcggggc ggagagccga 960 
gctcggggcc ggtgcggcgc gagctcaagc agttcctggg ctggctcaag aagcacgcgt 1020 
actgctccaa cctcagcttc cgcctctacg accagtggcg agcctggatg cagaagtcgc 1080 
acaagacccg caaccagcac aggacgaggg gatcctgccc tcgggcagac ggggcacggc 1140 
gagaggtcct gccagataag ctgtaggggc tcaggccacc ctccctgcca cgtggagacg 1200 
cagaggccga acccaaactg gggccacctc tgtaccctca cttcagggca cctgagccac 1260 
cctcagcagg agctggggtg . gcccctgagc tccaacggcc ataacagctc tgactcccac 1320 
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gtgaggccac ctttgggtgc accccagtgg gtgtgtgtgt gt9tgt:gagg gttggttgag 1380 
ttgcctagaa cccctgccag ggctgggggt gagaagggga gtcattactc cccattacct 1440 
agggcccctc caaaagagtc cttttaaata aatgagctat ttaggtgcaa aaaaaaaa 1498 



<210> 30 

<211> 1440 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> \msure 

<222> 9, 43, 58, 68, 1430-1440 

<223> a or g or c or t, iinknown, or other 

<220> 

<221> ml8c_£eati2re 

<223> Incyte clone 2825826 



<400> 30 

cttgtggant aacogttatt acccgccttt tgattgagct tantccccgt ttgccggnag 60 
ccggacgxiat tcggcacagg cgtccgctgc agtccgccgg cgagggagtt acgcacgtcc 120 
tgattctcct ggagtctcca gcccgcccag tggccgcagt cacccaggtc cagaggcggc 180 
ggtatcacag gctctccgac atgtctatgc tggctgaacg tcggcggaag cagaagtggg 240 
ctgtggatcc tcagaacact gcctggagta atgacgattc caagtttggc cagcggatgc 300 
tagagaagat ggggtggtct aaaggaaagg gtttaggggc tcaggagcaa ggagccacag 360 
atcatattaa agttcaagtg aaaaataacc acctgggact cggagctacc atcaataatg 420 
aagacaactg gattgcccat caggatgatt ttaaccagct tctagccgaa ctgaacactt 480 
gccatgggca ggaaaccaca gattcctcgg acaagaagga aaagaaatct tttagccttg 540 
aggaaaagtc caaaatctcc aaaaaccgtg ttcactatat gaaattcaca aaagggaagg 600 
atctgtcatc tcggagcaaa acagatcttg actgcatttt tgggaaaaga cagagtaaga 660 
agactcccga gggogatgcc agtccctcca ctccagagga gaacgaaacc acgacaacca 720 
gcgccttcac catccaggag tactttgcca agcggatgge agcactgaag aacaagcccc 780 
aggttccagt tccagggtct gacatttctg agacgcaggt ggaacgtaaa agggggaaga 840 
aaagaaataa agaggccaca ggtaaagatg tggaaagtta cctccagcct: aaggccaaga 900 
ggcacacgga gggaaagccc gagagggccg aggcccagga gcgagtggcc aagaagaaga 960 
gcgcgccagc agaagagcag ctcagaggcc cctgctggga ccagagttcc aaggcctctg 1020 
ctcaggatgc aggggaccat gtgcagccgc ctgagggccg ggacttcacc ctgaagccca 1080 
aaaagaggag agggaagaaa aagctgcaaa aaccagtaga gatagcagag gacgctacac 1140 
tagciagaaac gctagtgaNa aaagaagaag aagaaagatt ccaaatgaat ccttcccagc 1200 
cggggccttc cgaccactca gctgtcaggg cactgcgggg gcagacacct ctggcctgaa 1260 
gtcacagcag agttcacccc agagcgtctg ggcgcatctt gtggcatgcc catgggctgc 1320 
cgagtcctgc cctctcgcca catttccccc aagttacatt cccaggagga cctttttaat 1380 
gttctcaatc gtggctctca gacacaaata aatttttttg teiaactctgn Trnnnnnnnnn 1440 



<210> 31 

<211> 1251 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte clone 2936050 
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<400> 31 

gcctcgcjLac l:ofacctefcgc agagccgg^^ ggagcccaz^ gacg^ccagc gaagstcgga 60 
gcagcgatgg acggtcgggt gcagctgata aaggccctcc tggccttgcc gatccggccc 120 
gcggcgcgtc gctggaggaa cccgattccc tttcccgaga cgtttgacgg cgataccgac 180 
cgactcccgg agttcatcgt gcagacgggc tcctacatgt tcg t gg a cga gaacacgttc 240 
tccagcgacg ccctgaaggt gacgttcctc atcaccogcc tcac ggggc c cgccctgcag 300 
tgggtgatcc cctacatcaa gaaggagagc cccctcctca atgattaccg gggctttctg 360 
gccgagatga agcgagtctt tggatgggag gaggacgagg acttctaggc cgggagaccc 420 
tcgggcctgg gggcgggtgc tctgggaaga gttcgctgtg ttactggcca ccgctagggt 480 
ctccacaggc gccctccctc ccccgcaccc tccccctcga gccgccgcga tgtcccctgc 540 
gctcctgtcc cctccogcgt agtgcttgcc tttgttccag gaatagogct ccaggctcct 600 
getgcagctc ctgggccgca ctcttgagcg cgctgcctcc gccctctttt gecagececa 660 
gcccctccat gcacatttgg acgctgtcct gcacttcagg tgcaagctgg gctcctgtta 720 
catactggac agaccaccca cagccgctgc tgccaaccca cctccactcc tccagactgc 780 
cagccaacta catcattctg cccacagacc tacgctacca ctgccatcgc catccaccgc 840 
atctcaccaa cagactgtgg ctectagtga tgtggactca cctcggaggt atccgagctg 900 
gacacagccc ctggacagtg atccagacag ctggccgtcc cccaaggagc ctgtcacctt 960 
cagcgagacc catttcctcc ccatccccag agacctcttg tgttcctgcc acatagctgc 1020 
cagggcttaa gtgtgcctgg caaccaaatc gaatctctca ttttctcctg tggaccagtt 1080 
agttttgcct agaatcctgt tttcttctga atttgcagct gtctctctga tgggtgcctt 1140 
ttgttcaaca cagtaagccc tgctcccttc cctgctctaa tacactacct gtacaaaggt 1200 
tttttcctta tttttaataa atgtcagaca ctattaaata gaaaaaaaaa a 1251 



<210> 32 
<211> 1211 
<212> Did 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte clone 3428945 



<400> 32 

gatctggtaa agcccccatc ctggtctgac actgtcatgg gtaccagtct cctatgctgg 60 
gtggtcctgg gtittcctagg gacagattct gtttccacag atcacacagg tgctggagtc 120 
tcccagtctc ccaggtacaa agtcacaaag aggggacagg atgtaactct caggtgtgat 180 
ccaatttcga gtcatgcaac cctttattgg tatcaacagg ccctggggca gggcccagag 240 
tttctgactt acttcaatta tgaagctcaa ccagacaaae cagggctgcc cagtgatcgg 300 
ttctctgcag agaggcctga gggatccatc tccactctga cgattcagcg cacagagcag 360 
cgggactcag ccatgtatcg ctgtgctagc agcttagcga cagggggaac cggggagctg 420 
ttttttggag aaggctctag gctgaccgtg ctagaggacc tgaaaaacgt gttcccaccc 480 
gaggtcgctg tgtttgagcc atcagaagca gagatctccc acacccaaaa ggccacactg 540 
gtgtgcctgg ccacaggctt ctaccccgac cacgtggagc tgagctggtg ggtgaatggg 600 
aaggaggtgc acagtggggt cagcacagac ccgcagcccc tcaaggagca gccogccctc 660 
aatgactcca gatactgcct gagcagccgc ctgagggtct cggccacctt ctggcagaac 720 
ccccgcaacc acttccgctg tcaagtccag ttctacgggc tctcggagaa tgacgagtgg 780 
acccaggata gggccaaacc tgtcacccag atcgtcagcg ccgaggcctg gggtagagca 840 
gactgtggct tcacctccga gtcttaccag caaggggtcc tgtctgccac catcctctat 900 
gagatcttge tagggaaggc caccttgtat gccgtgctgg tcagtgccct cgtgctgatg 960 
gccatggtca agagaaagga ttccagaggc tagctccaaa accatcccag gtcattcttc 1020 
atcctcaccc aggattctcc tgtacctgct cccaatctgt gttcctaaaa gtgattctca 1080 
ctctgcttct catctcctac ttacatg£iat acttctctct tttttctgtt tccctgaaga 1140 
ttgagctccc aacccccaag tacgaaatag gctaaaccaa taaaaaattg tgtgttgggc 1200 
ctggttgcat t 1211 
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